marketing force in providing service to 
customers. 


McConnell is a graduate of the Uni- 
versity of Washington with a B.A. degree 
in mathematics. While attending college, 
he was a student actuary at Milliman and 
Robertson, an actuarial consulting firm. 
Before coming to STSC in December, he was 
responsible for development of opera- 
tional, accounting, and management report- 
ing systems for the Group Division of the 
American National Insurance Company. 


A new method for deleting lines from 
a function when in function definition 
mode has been implemented on the APL*PLUS 
System. The new method allows deletion of 
multiple lines in a single command, and it 
is terminal independent. A user can spec- 
ify the deletion of any number of lines in 
the following way: To delete lines num- 
bered N1, N2, N3, ..., use the form 


[L~V¥1 N2 N3 ...3 


For example, [~33 specifies the deletion 
of line 3, and [~2 5 7.1] specifies the 
deletion of lines 2, 5, and 7.1. 


Fractional line numbers can be used, 
but zero cannot. Line numbers can be 
Specified in any order, but they must be 
separated by one or more spaces. 


If any of the specified line numbers 
do not exist in the function, the report 
NOT FOUND: is given, followed by a list of 
the nonexistent line numbers. If other 
line numbers specified in the same dele- 
tion request are found, they are deleted. 
For example, if a user is defining a 
function that consists of line numbers 1 
through 10 and 7.6, and he enters the line 


[3] en"? 64 t2: Soot] 
the system responds with 


NOT FOUND: 12 5.1 
[11] 


and only lines 7.6 and 5 are deleted. 


Upon completion of a line deletion 
command, the system displays a line number 
prompt which is the last line number in 
the function plus its normal increment. 
For example, if the last line number in 
the function is 10, the prompt is [11]; if 
the last line number is 8.2, the prompt is 
[8.3]. Consequently, [~] can be used to 
transfer to the next largest line number 
beyond the last one in the function. 


As with other function editing 
commands, the user can enter and/or exit 
function definition mode on the same line 
as a line deletion command. For example, 


VFOOL~4 3 8] 
[11] 


CI Q D ty 


Ui 


U JN ZPaA 


Jy N Oh 


or 
[3] Cmi 3 8y 

Or 

VFOO[~4 3 8]V 


Also, as with other function editing com- 
mands, the line deletion command must be 
the rightmost command entered on the line. 


WHIZBANG OF THE MONTH 
by Roy A. Sykes, Jr. 


BRANCHING AND ITERATION 


This article discusses some guide- 
lines and techniques for branching and 
iteration in APL. Many people have asked 
me, "What's the fastest way to branch?" 

I hope to shed some light on this often- 
maligned subject. 


Guidelines for Branching 


The following three guidelines will 
Simplify the maintenance of your code and 
improve its readability. The techniques 
you use are not as important as how con- 
sistently you use them. 


l. Branch to labels, not line numbers. 

If your branch targets are line numbers 
rather than labels, simply inserting or 
deleting lines from a program can become a 
heinous task. Furthermore, minor benefits 
in storage and execution speed are gained 
by using label variables rather than con- 
stants. 


2. Adopt a consistent notation for 
labels. Labels like LOOP, AGAIN, DO, 
MORE, and OOPS seldom add clarity to pro- 
grams, especially large programs. Fur- 
thermore, they give the reader little or 
no clue as to where in the function they 
refer, or whether they are in fact labels. 
Common notations are {L1;22;L13;...} and 
{A;B;C;3...}. More mnemonic conventions 
can help the reader recognize program 
segments (for example, LP1 or FR2). The 
notation or convention that you choose is 
irrelevant as long as you adhere to it. 


3. Adopt consistent techniques for 
branching and iteration. For instance, 
some people favor 


+LABELxi1condition 
while others use 

+( condition) pLABEL 

Again, it's not so important which 
techniques you use (except for trivial 
differences in execution speed), but that 


you use them consistently. Among my per- 
sonal favorites are: 


+(condition)pLABEL or +(condition)+LABEL 
+(xpvector )pLABEL or +(pvector )+LABEL 


+(conditions)/LABEL or +(A/conditions) pLABEL 


+LABELxcondition or >LABELx~condition 


Programs can often be made more read- 
able by using functions with mnemonic 
names for branch calculations. For 
instance, 


© *THISL IF M2C<C+1 
is certainly more understandable than 
© +(M2C<C+1)40LC 


Or 


H3 


© >T[ıM2C+C+1 


You can define the functions IF and 
THISL as: 


V R+A IF B V R<«THISL 
C1] R<«B/A [1] R+1414+0LC 
y y 


IF can also be used for multiple 
conditions. For instance, 


+(Z4,L5) IF 1 1=xA 


will branch to L4 if A is negative, L5 if 
A is positive, and fall through if 4A is 
Zero. 


These functions are especially bene- 
ficial in programs with complex branching 
logic and many transfers of control. How- 
ever, in Simple iteration (see below) they 
exact a small but noticeable penalty in 
CPU resources. 


2. Faster techniques for iteration. 


The following techniques, while quite 
fast, generally detract from readability 
and should therefore be used judiciously. 


Typically, loops involve a leading 
test as exemplified by ITERATE. 


V ITERATE M;C 

[1] SETUP O C+1 

[2] +0 IF C>M 0 PROCESS 9 C+«Ct+1 0 +ULC 
y 


We can produce a speedier version of 
ITERATE by eliminating the IF subroutine 
and using a label. 


V ITERATE2 M;C 

[1] SETUP © C+1 

[2] A:*+(C>M)p0 © PROCESS © C+C¥+1 © >A 
y 


By employing a trailing test, the 
branch overhead can be reduced signifi- 
cantly. The once-executed leading test, 
+M¥0, accommodates the zero case. 


V ITERATE3 M;C 

[1] SETUP © +M+0 © C¥1 

[2] A:PROCESS © +(M2C+C+1)pA 
V 


Further, by precalculating all branch 
points, we can avoid the serial iteration- 
by-iteration testing. APL permits (and 
even encourages) parallel processing on 
data -- why not parallel calculations, 
testing, and branch-point determination? 
ITERATE4 illustrates a simplified form of 
this technique. 


V ITERATE M3C3L 
[1] SETUP © C+1 © +L«(MpA4) ,0 
[2] A:PROCESS 9 +L([C+C+1) 

V 


Note that all of the above algorithms 
correctly handle the case where zero iter- 
ations are requested. They are also re- 
startable; that is, if PROCESS produces a 
WS FULL or VALUE ERROR, the problem can be 
corrected, and *ULC will resume execution 
normally. It is true that ITERATE2 could 
have been shortened to 


C1] SETUP © C+0 
[2] A:>+(M<C+«C+1)p0 © PROCESS © +A 


However, resumption after a VALUE ERROR on 
PROCESS would have incremented C twice 
(once before the error, and once after the 
restart), thereby skipping one iteration. 


The following table illustrates the 
relative CPU timings of the above tech- 
niques with varying numbers of iterations. 
The ratios are to ITERATE4; SETUP and 
PROCESS were set as empty matrices. 

Iterations (M) 


eek semue anes MESE Aree 
ITERATE PiU OER 876 seegers. 97 
ITERATE? e0 4.08% 10454 17E Ee reee 
ITERATE3 65 .92 1.23 1.40 1.44 
ITERATE 1.00 1.00 1.00 1.00 1.00 


(Executed 11/29/76 at 7:27 P.M. EST 
on APLPLUSC with 20 users) 


ae) 


3. 2 Recursed! 


Did you know that you can use the 
execute function (2L) to effect iteration 
in desk calculator mode? Assume M is the 
number of iterations (M21), and C is the 
counter (initialized as C+1). Now compare 
the following two lines -- one in a func- 
tion, the other in desk calculator mode: 


[V] LDL: PROCESS © +(M2C+C+1)/L 
2L+'"PROCESS © #2(M2C+«C+1)/L' 


Both have the effect of executing PROCESS 
as many times as called for by M. Observe 
that whereas the first line uses branching 
to the label L, the second uses e to 
recursively execute the statement repre- 
sented by Z. Unpleasantly, the execute 
construction consumes significantly more 
CPU time and is far more extravagant in 
its use of workspace storage. It does, 
however, illustrate the interesting rela- 
tionship between > and 2. 


translated to an ASCII "BEL" control code; 
this results in a "beep" or a bell. On 
Selectric(R)-based terminals, the bell 
character prints as a space followed by an 
upshift and a backspace; this causes the 
typesphere to "twitch". During input, the 
"BEL" control codes (control G) received 
from an APL-ASCII or ASCII-subset terminal 
are ignored. 


The niladic system function UTCBEL 
returns an explicit result containing the 
bell character as a character scalar. 


Note that on some terminals the sound 
produced by the "BEL" control code will 
last more than one character-time (1/30th 
of a second at 30 cps); thus, several bell 
characters may need to be separated by one 
Or more null characters (O7CNUL) in order 
to be heard as distinct sounds. 


WHIZBANG OF THE MONTH 
by Roy A. Sykes, Jr. 
STRING SEARCHING 


The ability to rapidly locate string 
occurrences in other strings is often 
useful. The many techniques applicable to 
the problem also provide enlightening 
examples of efficient problem-solving in 
APL. This article discusses several 
algorithms for string searching to illus- 
trate efficiency of execution and some 
interesting techniques. 


A string is a vector, typically 
character. String searching means locat- 
ing the occurrences of one string, called 
the substring, in another string, called 
the target string. The result can be a 
Boolean vector of the same length as the 
target string with 1's denoting the 
leading element of each match. More 
commonly (see each function below), the 
result is the origin-sensitive indices of 
the 1's in the Boolean vector (i.e., 
BV/ipBV). Overlapping matches are possi- 
ble. In all functions, the left argument 
A is the target string and the right 
argument B is the substring for which we 
will search. 


The most fundamental APL solution is 
quite straightforward: 


V ReA SS1 BCDE 
[1] C+pA+yA © D€pB+, B© E«O0fC-Ol[ D-1 
C2] Re (E+a/((1D)-11)0A°.=B)/1E 
[3] a OR ALTERNATIVELY, 
C4] aR+(E4+A4((1D)-11)0B°.=A4)/1EF 
y 


All characters in the target string are 
compared with all characters in the 
substring, the logical matrix is shifted 
to align each potential occurrence, and 
the indices of complete matches are 
selected. Notice that if the length of 
the substring B is greater than that of 
the target string A, then no matches are 
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possible. Furthermore, if the substring 
is empty, all indices are returned. The 
number of potential matches, E, is nor- 
mally 1 plus the difference in lengths of 
the strings, that is, E+«1i+C-D or E+«C-D-1. 
The two Of 's are to correctly handle empty 
Or overlong substrings. 


A somewhat more elegant solution 

employs inner product to detect matches: 

V RA SS2 B3C3D3E 
Lig C+pA+,A © D+epB+,B © E+«0fC-Of D-1 
[2] Re (E+BA.=(D,C+1)pA)/iE 
C3] a OR ALTERNATIVELY, 
[4] aR«(E+Ba.=((1D)-11)0(D,C)pA)/iE 

V 


Rather than rotating a logical array, this 
technique builds a matrix of all possible 
matches and then uses A.= to select those 
that are complete. Observe the pleasant 
effect of the reshape on line [2] -- it 
both builds and shifts the matrix. S52 is 
somewhat faster than SS1, although it 
requires more workspace area (UWA) in 
which to execute. 


Unfortunately, both of these algo- 
rithms are inherently inefficient because 
they must compare all possible combina- 
tions of characters in the two strings; 
that is, they are "blind", and continue to 
process even if no matches are possible. 

A somewhat better approach is to determine 
the indices of the first character, and 
compare only the remaining candidates with 
the rest of the string: 


V RA SS3 B;3C3D;3E 
C1] C«pA+,A © D+pB+,B © +BLAtD 
[2] A:ReiC © 70 A EMPTY STRING 
L3] R«(AeB)/1C Q >0 a SINGLETON STRING 
C4] B:E«0[C-D-1 0 +(pR+(EtAe1t+B)/1F)+0 
[5] R+(ACRo.+( 1*11)41D]A.=14B)/R 

y 


This function employs some new techniques 
worth discussing. Notice the unusual 
branch on line [1] which has the effect of 
trapping both the empty and one-element 
cases. If desired, line [2] could then be 
modified to return ı0 in the empty case 
rather than all indices -- a more usable 
result. 


Line [4] calculates the number of 
potential matches, and locates the occur- 
rences of the first character. (For 
character arguments, e is typically faster 
than = because of the internal algorithm 
used.) If the first character is not 
found, the function exits immediately with 
an empty result. Otherwise, a matrix of 
all remaining candidates is built via 
outer product and subscription, and A.= is 
used to produce the compression vector 
which selects only complete matches. The 
("1*11)+41D construct cleverly accounts for 
the index origin while removing the index 
of the character already examined. 


While SS3 is a considerable improve- 
ment over SS1 and S$S2, it is still rela- 
tively naive. For instance, if the first 
character of the substring occurs fre- 


quently in the target string, little or no 
Savings may accrue. Furthermore, the risk 
of a WS FULL is increased because of the 
larger space required by the matrix of 
indices. A more sophisticated algorithm 
employs basically the same technique, but 
attempts to minimize the number of initial 
"hits" by searching for the character in 
the substring that has the least number of 
expected occurrences: 


V RA SSH BsCsDsE3F3G 
C1] C+pA+,A © D+pB+,B © +BLAtD 
[2] A:R+«1C 9 70 aA EMPTY STRING 
L3] R+(AeB)/1C © >0 a SINGLETON STRING 
Cul] B:F+' ETAISONRLCFHUDMPBWYGKVJQXZ.,' 
[5] E+0fC-D-1 © G+FıB © F+«Gil/G 
[6] >+(pR+«(E+(F-11)+AeBLF])/1i1F)+0 
E7) G+ (F41D)/iD 
[8] R«(ACR°.+G-11]A.=B[G])/R 
y 


Line [4] specifies a letter distribution 
vector arranged in descending order of 
expected frequency in normal text. By 
line [6], F has become the index in the 
substring of the letter with the fewest 
probable occurrences in the target string. 
This letter is used for the first compari- 
son. Its index is then removed on line 
[7] for the outer product calculation of 
indices and A.= comparison on line [8]. 


Since we have concluded that reducing 
the number of potential hits on one 
character is beneficial, why not do it on 
two or three? In fact, why not conclude 
the entire process by constantly refining 
the potential matches until none remain, 
Or until we have exhausted the substring? 
SS5 does just that: 


V RA SSS BCDE 
C1] C+pA+,A © D+pB+,B © >BLA+D 
[2] A:R+ıC O 70 a EMPTY STRING 
L3] R«(AeB)/1i1C O +0 a SINGLETON STRING 
[4] B:E+«O0fC-D-1 © +(pR+(F+Ae14+B)/1F)+0 
[5] B+ (ıC+1)+B 
C6] C€:+(pR+(ALR+CJ]eBLC])/R)+0 
pr] +CxD>C+C+1 
y 


Through line [4] the function is 
identical to 553; that is, both empty and 
one-element strings are accommodated, and 
the first character of longer strings has 
been compared. Line [5] initializes the 
substring element counter, and modifies B 
so that the counter is appropriate in 
either origin. Lines [6] and [7] compose 
the loop; on line [6] the Cth character 
beyond existing matches in the target 
string is compared with the Cth character 
of the remaining substring. If no matches 
remain, the function terminates. Other- 
wise, the counter on line [7] is incre- 
mented and the function branches back to 
line [6] if there are more characters to 
compare in the substring. 


In most cases, SS5 is highly effi- 
cient. It slows down somewhat if the 
Substring is quite long because it must 
iterate on each element. On the other 
hand, its antecedents are all dangerously 
prone to WS FULL problems in similar 
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Situations, especially if the target 
String is also long. However, WS FULL 
errors are still possible in SS5, particu- 
larly if the first character of the 
Substring occurs often in the target 
String. This situation can be remedied in 
much the same way as was illustrated in 
the progression from SS3 to SS4: 


V ReA SS6 BsCsDsE:F3G . 
[1] C+pA+,A © D«pB+,B © >BLA+D 
[2] A:R«iC © +0 aA EMPTY STRING 
[3] R«(AeB)/1C © +0 a SINGLETON STRING 
[4] B:F«' ETAISONRLCFHUDMPBWYGKVJIQXZ.,' 
[5] E+0[C-D-1 © B+B(G+VFiB] 
[6] G+G-F+iC+1 
C7] +(pR+(E+GLFIVAeEBLF])/1£)+0 
[8] G+FYG © B+FYB 
C9] C:>(pR+-(ACLR+GCC]]eBCC])/R)+0 
[10] +CxD>C<+C+1 
y 


An expected letter distribution vector is 
used to rearrange the elements of the 
substring so that the fewest possible 
matches are likely for each iteration. 
Besides reducing the expected storage 
requirements, this technique normally 
speeds up the search further because fewer 
comparisons need be performed. 


Some conclusions can be drawn from 
the above discussion: 


o the shortest solution is not necessar- 
ily the fastest; 


o handling special cases as such is 
often beneficial; 


o iteration can be faster than "closed- 
form" code; and 


o intelligent analysis of an algorithm 
can yield substantial gains in CRU 
efficiency and. workspace conservation. 


In case you're dubious, the following 
timing comparisons should dispel any 
doubts. The target string is the variable 
INSTRUCTIONS (3683 characters) from 
workspace 1 FILEAID on the APL*PLUS 
System. The times are in CPU millisec- 
onds; the origin is 1. 


B oR 

SUBSTRING HITS SS1 SS2 S83 SS4 SSS SS6 
ea 3683 19 19 2 2 2 2 
r 3 783 85 60 8 8 8 8 
a 0 82 56 2 2 2 2 
: r 342 164 -105 2). 20- 15 T4 
'AN' 33- 163 -404 10 9 7 7 
‘aw! 0 162- 703 3 4 3 4 
" THE' 20 < 262 ° 204 55 9 14 8 
'THE * 16 261 203 14 9 9 9 
'MATRIX' 5 257 300 15 5 10 9 
"RATMIX' O27cesee +301 23 5 9 5 
" PERMITS * 191-056 2403 Lit13 -17 28:48 
"PERMITS © 15 1 °O55 Enot DEI 177 TFt 
'NOTFOUND' 0 454 £399 227° 239 7 7 
"ae10OUp~w' @» 55-399 3 4 4 4 


(Executed 2/25/77 at 9:19 P.M. EST 
on APLPLUSC/470-10011 with 8 users) 


MANAGER OF OPERATING SYSTEMS 


Stuart Bell recently joined STSC as 
Manager of Operating Systems, a new 
position in the company. His job in the 
Systems Department is to select, generate, 
and maintain operating systems which will 
Support future enhancements to the 
APL*PLUS System. 


Bell was previously a Computer 
Scientist at Computer Sciences Corpora- 
tion. He holds a B.S. degree in physics 
from Drexel University. 


NEW APL*PLUS SYSTEM COMMAND 
SETS USER'S INTERACTION LIMIT 


The new APL*PLUS System command 
)INTLIM can be used to set a user's 
interaction limit. The interaction limit 
is a limit on the amount of computer 
processing time that can be consumed 
between inputs (carriage returns) from the 
terminal. When an interaction limit is in 
effect, execution is halted if it consumes 
more computer time than specified by the 
Limit. 


The )INTLIM system command is most 
useful during the development and testing 
of a program. It provides an automatic 
way to interrupt a program, and can help 
reduce the computer time wasted by a 
runaway program, such as one containing an 
unexpected program loop. This command is 
normally not used when running a produc- 
tion program. 


More details concerning the )IWTLIM 
command are contained in news item 318 in 
workspace 1 NEWS on the APL*PLUS System. 


WHIZBANG OF THE MONTH 
by Roy A. Sykes, Jr. 


USING BOOLEANS TO CONTROL COMPUTATION 


One of APL's unique contributions to 
computing languages is its ability to 
generate and manipulate Boolean arrays 
(those containing only i's and 0's). 
Although binary (or bit) datatype is 
available in many other languages, it is 
often clumsy to use, requiring conversion 
and indirect manipulation. APL, on the 
other hand, handles Boolean data directly 
and efficiently; the data is stored 
compactly (one value per bit) and pro- 
cessed rapidly. 


Boolean values are most often gener- 
ated by one of the six relational func- 
tions (< < = 2 > #) or by epsilon (e€). 
Five other functions perform logical 
calculations: and (A), or (Y), nand (x), 
nor (*), not (~). Together, these provide 
all the nontrivial functions of logical 
calculus. For instance, exclusive-or is z 
and logical implication is <. 
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But APL's strength with Boolean 
values lies in its ability to use them 
with every primitive function that allows 
numeric arguments. Some functions, such 
as compression (/) and expansion (\), are 
designed specifically for Boolean argu- 
ments; these allow us to select from, and 
insert values into, arrays. Other func- 
tions for which Boolean arguments are most 
useful are rotate (%), grade up (Å), grade 
down (¥), decode (1), and several scalar 
dyadic functions (+ - x 4 | !). The 
reduction and scan operators also gain new 
Significance with Boolean arguments. 


Most uses of Boolean values deal with 
representing logical true-false condi- 
tions. For instance, 


V+?14914 © V,[0.5] V>s8 
8 711 5 912 8 3 6 1144 5 7 13 
002i 00 i iù 0o 0 6 0 4 6 o 7 


V>8 generates a Boolean vector (or bit 
vector) that asserts whether the corre- 
sponding element of V is greater than 8; 
the result contains a 1 if the assertion 
is true and a 0 if it is false. We can 
use this result in a variety of ways: 


. +/V>8 How many greater than 8? 
i A/V>8 All greater than 8? 
i v/V>8 Any greater than 8? 

<\V>8 The first greater than 8. 


0010000000000 0 


(V>8)i1 What is its index? 


(V>8)/V 
a1. 9 22? 28-35 


All elements greater 
than 8. 


(V>8)/ipV Indices of all ele- 
3 5 6 11,14 ments greater than 8. 


The entry VLYV>8] moves all elements 
greater than 8 to the front of the vector: 


VCYV>8] 
11 9 32.14 13 8758.3 6-1 5.7 


Boolean values can also represent 
values in binary representation. The 
following expression gives the base-2 
representation of 165 in 8 bits: 

2 2 
10100 


We can convert back to the base-10 value: 


24.4.0 14.0 0 1,01 
165 


With some imagination, we can also 
use Boolean values to control other 
calculations. A common operation is 
multiplication by a bit vector. For 
instance, assume we have a matrix REGS of 
sales region names and an associated 
vector NACC indicating how many new 
accounts each region has generated. We'd 
like to produce a new-account report. 


DISP+NACC>0 Show only regions 
with new accounts. 
NA+NACCXWACC>1 Show the number only 


if it's more than 1. 


TM E ai. 


DISP#REGS,'PU (DRAM) OBLI6' OFMT NA 
NEW ENGLAND (3) 
MIDDLE ATLANTIC (10) 
EAST NORTH CENTRAL 
EAST SOUTH CENTRAL (5) 
WEST SOUTH CENTRAL 
PACIFIC (2) 


fo») 


(~DISP)#REGS 
SOUTH ATLANTIC 
WEST NORTH CENTRAL 
MOUNTAIN 


By using logical multiplication, we 
"zeroed out" WA for regions with only one 
new account, and used the B modifier to 
Suppress display. 


Another application is to return 0 
rather than 1 for 0+0: 


Here we used exponentiation by a logical 
vector to change 0's into 1's because 01 
will produce the desired 0. We could also 
have used A+#B+B=0 in this case. 


The following table (the first two 
rows were illustrated above) shows the 
most useful manipulations and what they 
produce when BOOLEAN is 0 or 1. 


BOOLEAN<0 BOOLEAN<+1 


Operation 
AXBOOLEAN 
A*BOOLEAN 
BOOLEAN! A 
BOOLEAN|A 
Ax~BOOLEAN 


Ax~BOOLEAN 


( #) 


m wh BRE FO 
FPooxs*x B& 


(#) Only for integer A. 


Often when dealing with arrays, we 
wish to process part of the array while 
leaving the rest untouched. For instance, 
add 3 percent to only those bills over 30 
days old: 


BILLS*BILLIS*1.03*DAYSOLD>30 


Other times, we may wish to alter the 
order of the arguments of the computation 
itself based on a conditional array (e.g., 
A-B or B-A). For functions with an 
inverse, we may wish to conditionally 
alter the function used in a computation 
(e.g., A+B or A-B). If a function has an 
inverse, we can use 1 and one of x*|! on 
the conditional array to give us the 
proper answer. Or, as shown above, we can 
use the conditional array to control 
whether the operation is done or not. In 
the table below, C is the conditional 
array. C controls the function used in 
the first two lines, the order of the 
function's arguments in the next two 


Qcarnrag 
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lines, and whether or not the function is 
performed in the last three lines. 


If C=0 “If C=1 
Return Return By Using This Expression 


A+B A-B A+B x 1*C 
AxB A+B AxB * 1*C 
A-B B-A (A-B)x 1*C 
A+B BtA (A#B)* 1*C 
A A+B A+BxC 

A AxB AxB*C 

A A*B A*B*C 


A more general way to interchange 
scalar function arguments conditionally is 
by reducing a two-row array that has been 
rotated by a conditional vector. For 
example, the following performs A*B if C=0 
and B*eA if C=1: 


A,L13°8,T0.357 © 
Ss 1 7 1 9 2 5 1 2 2 7 3 8 
1 6 1 8 4 4 227 5 6 2 4 4 
0 1 0 1 0 14 0 4 0 14 0 1 40 


(1,p4)p*#COA,[0.5] B 
59 6 7 8 9 16 25 27 32 36 49 64 81 


Booleans can also be used to select 
one of two values via subscription. (Note 
that if the index origin is 0, the Boolean 
values can serve directly as indices, 
obviating the need for the +1.) 


BEU TU TTI 00511709 
7 110B+1] (or 7+Bx4 or 7+B\4) 
OLET 22° 22-i3''7 TLI 77 


This capability allows simple plotting: 


O+FREQ+8?742 
10 24 15 21 35 31 5 18 

' O'C1+FPREQ°.21f /FREQ] 
OO00000000 
OQOOOOD0OOOOCOD0000000000 
OOO0O000R00000000 
QOOUOO00O000000000000 
a aa EA 
OOOO0Oo0ooo000000000 


Another way to use Booleans is to 
control the merge order of two vectors: 


VOWELS+«'AEIOUY' 
CONS+'FCTSL' 

MERGE+1 010310004141 0 
(pVOWELS )=+/~MERGE 


1 
(pCONS)=+/MERGE 
1 
(VOWELS,CONS)CAAMERGE] 
FACETIOUSLY 


MERGE controls the selection of data from 
VOWELS (MERGE=0) or CONS (MERGE=1). 


Booleans can be used to control 
computations in many other ways. Bob 
Smith, known as the "Boolean Bomber" at 
Scientific Time Sharing Corporation, has 
done significant work in this area. His 
paper, "Boolean Functions and Techniques" 
(STSC Working Memorandum No. 106), is 
available for $2.00 and can be obtained by 
returning the attached Reply Form. 


WHIZBANG OF THE MONTH 
by Roy A. Sykes, Jr. 


APL POTPOURRI 


(Note: All examples are in origin 1.) 


Did you know that ... ? 


'thpA selects the first element of A 
aS a scalar. What happens if A is empty? 


3 5+SCALAR is acceptable, and 
creates a matrix with all zeros or blanks 
except for position [1;1] which is the 
same value as SCALAR. 


(Np0)+SCALAR has the same effect as 
(Np1)pSCALAR or (Np1)+SCALAR -- it creates 
a Singleton of rank N. 


A scalar is allowed as the right 
argument to compression and expansion. In 
each case, the scalar is first replicated 
to the appropriate length. Thus, B/SCALAR 
is identical to (+/B)pSCALAR. 


A+1/A changes A into a one-element 
vector if A was a scalar, but leaves all 
other arrays unchanged. 


O=1+0pA returns 1 if A iS numeric, 
0O if A is character. 


A/,B=B=1 returns 1 if B is Boolean 
valued, 0 otherwise. A/,Be0 1 can also 
be used but is somewhat slower. 


You can demote a numeric array R to 
its most compact internal representation 
with the following expression: 


A+pR © L+(x/A)p0 © LL I*R © R+ApL 


Note particularly the third statement, a 
syntactic construction not often seen in 
APL programs. 


Applying reversal ($[K]A) or rotation 
(BOLK]A) along a nonexistent coordinate 
(that is, where ~KeippA) is acceptable and 
has no effect. This can be useful. For 
instance, VL[O[K]¥V] sorts V in ascending 
order if K is 1, and in descending order 
otherwise. 


The expression +/[2] 3 0 4 p5 
returns a 3 by 4 matrix of zeros, thereby 
actually increasing the storage require- 
ments. Can you guess what L/ 2 3 0 p6 
returns, and why? 


Relational scans (<\ <\ =\ 2\ >\ #\) 
do not always return Boolean values. For 
example, 


>. 0.5 0.5.0.5 
0.5 0 1 


Any outer product can be rewritten as 
an inner product. For instance, A°.+B can 
be restated as (((pA),1)p4)0.+(1,pB)pB 
where o can be any scalar dyadic function. 
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Any reduction on numeric data can 
likewise be written as an inner product. 
For instance, x/[K]A can be restated as 
Ox. +(AAKZ1ppA) QA. 


Base value (1) can be written as a 
+,x matrix product. Thus 41B (without 
scalar or unitary coordinate extension on 
the left argument) can be restated as 


(dx\(pA4)4+1,04)+.xB. 


A+A is faster than Ax2, and AxA is 
faster than A*2. Generally the simplest 
function is fastest; but the speed differ- 
ence is small, so use the most natural 
construct. 


The expression (1*p4A)&A selects the 
major diagonal of any array. 


'TO' or 'AO' or 'GMM' can be used 
with DFMT to suppress display of unwanted 
columns. For example: 


'T0,22,A0,74, 3GMM,F8.3' OFMT 4 8p132 
2 4 8.000 
10 12 16.000 
18 20 24.000 
26 28 32.000 


Many experienced APL programmers 
cannot name all APL primitive scalar 
dyadic functions. Can you? (Hint: + is 
one of them, and there are twenty more.) 


The shape of A+B is not all that easy 
to determine. Write a Single statement 
that calculates the result of pA+B in as 
few characters as possible with the 
following constraints: 


1) It works in either origin. 

2) It uses no scalar dyadic functions, 
nor any functions derived from them 
(that is, no reduction, scan, inner or 
Outer product). 

3) It does not use execute (#), assign- 
ment (+), or diamond (4). 

4) It uses no named objects other than 4 
and B. 


Please address all entries to ROY in 
the STSC Mailbox, or contact the author 
using the attached Reply Form. Deadline 
is 18 November 1977. The author of the 
shortest entry will receive a dinner for 
one person if his solution is 57 charac- 
ters or more and a dinner for two persons 
if his solution is less than 57 charac- 
ters. Otherwise stated: 


DINNERS+«1+57>pSOLUTION 
but only if 
(2SOLUTION) A.= pAtB 
for all A and B. 
Results of this contest will appear 


in the January/February 1978 issue of this 
newsletter. 


Support. In this newly created position, 
he is responsible for developing and 
maintaining many STSC software products, 
including QUICKPLAN(TM), ACT*PLUS, and the 
APL*PLUS Securities Database System. 


Spencer joined STSC in 1970 as 
Manager of Marketing in the Washington, 
D.C., office. Since that time he has held 
positions as Manager of Marketing in the 
San Francisco office and APL Applications 
Specialist in Los Angeles. He holds a 
B.S. degree in applied mathematics from 
Clemson University. 


MANAGER OF SYSTEMS SUPPORT TEAM 


Quilla Roth is the new Manager of the 
Systems Support Team. In this position, 
she is responsible for integrating modifi- 
cations, enhancements, and program fixes 
to the APL*PLUS System. 


Roth joined STSC in 1973 as an 
Operator/Programmer. She was promoted to 
Supervisor of Operations in 1974 and to 
Manager of Operations in 1975. She 
graduated from Vassar College with a B.A. 
degree in mathematics. 


MANAGER OF OPERATIONS 


As STSC's new Manager of Operations, 
David Taylor is responsible for the 
reliability and availability of the 
APL*PLUS System. His duties include 
ensuring security in the Computing Center, 
monitoring capacity, and providing batch 
services and system backup. He also 
watches over Computing Center shipping and 
receiving and helps plan and install new 
equipment. 


Taylor came to STSC in 1975 as an 
Operator/Programmer, and was promoted to 
Supervisor of Operations in 1976. He 
attended college in London where he earned 
the equivalent of a B.S.E.E. degree with 
emphasis on computer engineering. 


SUPERVISOR OF OPERATIONS 


Replacing David Taylor as Supervisor 
of Operations is John Sell. In this 
position, Sell oversees the day-to-day 
operations of the Computing Center to 
ensure its smooth running. 


Before Sell joined STSC as an 
Operator/Programmer in 1976, he spent over 
20 years with the U.S. Army. His final 
nine years of service were in the comput- 
erized communications field. Sell holds 
an A.A. degree in science from the State 
University of New York. 


MANAGER OF HIGHER-ORDER LANGUAGE TEAM 


As Manager for STSC's Higher-Order 
Language Development (HOLD) Team, Paula 
O'Donnell coordinates the efforts of the 
group responsible for writing programs in 
higher-order languages such as APL, COBOL, 
and PL/I to handle tasks like the source 
level transfer of APL workspaces, COBOL- 
to-APL file conversions, and billing. 


O'Donnell came to STSC in 1973 as an 
Operator/Programmer. In 1975 she trans- 
ferred to the Technical Support Group and 
waS promoted to Acting Manager for that 
group in late 1976. She became Manager 
for HOLD earlier this year. O'Donnell 
earned an A.A. degree in arts from Dekalb 
College in Atlanta. 


WHIZBANG OF THE MONTH 
by Roy A. Sykes, Jr. 


DATA SELECTION 


Selecting data from an array is one 
of the most common operations in APL. 
There are three primary facilities for APL 
data selection: 


o positional, using subscription (A[B]) 
o conditional, using compression (B/[K]A) 
o bounded, using take (BtA) or drop (B+A) 


There are also several specialized 
techniques for frontal and diagonal 
selection using reshape (BpA) or transpose 
(BXA). Besides selecting data, some of 
these functions can replicate data (sub- 
Scription, compression of a scalar, and 
reshape), extend data (take), or reorgan- 
ize data (Subscription, reshape, and 
transpose). This article discusses these 
techniques aS applied to the matrix A in 
origin.i: 


OL0<«1 -9 O+A+(10x15)¢.+19 


Subscription is the most generalized 
selection function. It can be used to 
select, reorder, and replicate data. 


AC2513;:9157 5 5] 
29 24:25 -27.25 25 
59 51 55 57 55 55 
19 1445 47-425 415 


Because of its flexibility, subscrip- 
tion is typically the slowest of all 
selection functions applied to a matrix. 
However, it alone can reorder or selec- 
tively replicate data, and it can be used 
to the left of assignment (+). In the 
absence of these requirements, subscrip- 
tion is best used only to select single 
rows Or columns, or small subarrays. 
Subscription is somewhat faster on vectors 
than it iS on matrices. 


Compression is less flexible than 
subscription; it can select only along a 
Single coordinate, and it cannot replicate 
Or reorder data. However, it can select 
non-contiguous rows or columns in the same 
Order they occur in the array. 


11001 FA 
1I" i2 '137I5 15- 10 17 10-19 
21722 793 24-25-26 27 20 79 
51/32 53 54 55 56 57 50-99 


1 0001010 1 /A 
ti°25 17-193 
21 25 27 29 
31 35 37 39 
41 45 47 49 
ot So 57 59 


Because of its more limited capabil- 
ity, compression is typically faster than 
subscription. Of course, it is ideal when 
selecting on the basis of a Boolean vector 
from epsilon (e) or one of the relational 
(< < = 2 > z) or logical (Av * ¥ ~) 
functions. As noted, compression can 
select along only one coordinate. Thus 
the matrix must be compressed twice in 
order to select both rows and columns. 


011007000111100. 7/A 
24 25 26 27 
34 35 36 37 


If indices are already provided for 
one coordinate, it is usually faster to 
use B/ıpB and subscription. For instance, 


ACB/i1pB; 5 4 7 6] 
should be used rather than 


BIAL; 5 4 7 6] 


Take and drop are the most restric- 
tive selection functions. They allow only 
contiguous blocks of data to be selected, 
and they cannot replicate or reorder. 

They are quite fast, however, at selecting 
large blocks of data. One application of 
either function is sufficient to select 
any corner subarray. 


3 54A a 2 44A 3  5tA a 2 444A 
Peia ta 04 £5 15 16 17 18 19 
21 22 23 24 25 25 26 27 28 29 
31 32 33 34 35 35 36 37 38 39 

“8 StA a2 UHA ~3 54A A 2 444 
31 32 33 34 35 35 36 37 38 39 
41 42 43 44 45 45 46 47 48 49 
51 52 53 54 55 55 56 57 58 59 


Any contiguous block of data can be 
selected by two applications of either 
function. The subarray 


AL2°3~3 u S'6 33 
24 25 26°27 
34 35 36 37 


can be selected by each of 16 expressions: 


2 4 4+4 3 7 +A 2 ut 3 6 +A 
"o "e +3 Ta 42 “2 44 2 3 4A 
1 3+ 3 7 4A 1 2 +4 3 6 +A 
MS S: Ral ao ee T 4 “2 E e iir “+A 
2: RPF “ETT HA 2 4+ 4 6 FA 
Ppp |G "2 44 2 4ut 1 3 $A 
Ta OR Ty lg eg i ae a Si ee 
“2 34+ 1 2 +A “pe “9 eo gg FA 
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Unless extension (padding with zeros or 
blanks) is needed, convenience should 
determine whether to use take or drop. 


Reshape and transpose are normally 
considered structural functions, but are 
often used for specialized selection. 
Reshape can select data from the front ọf 
an array. For instance, the following 
expression selects the first element as a 
scalar: 


"tod 
11 


The next expression selects the first row 
as a vector: 


SpA 
11- 32° 23°35: 35°35 27°23 a2 


Transpose can select the major 
diagonal, or the other diagonals if used 
with reversal. 


1 1 QA 
14°22 33°88" 55 
1 1 QdoA 
19 28 37 46 55 
1 1 eA 
Si 42 33 24 15 
1 1 QgeA 
59 48 37 26 15 


Both reshape and transpose are very fast. 


In general, the speed of all selec- 
tion functions depends on the amount of 
data being selected, not on the size of 
the original array. As more data is 
selected, more CRUS will be consumed. By 
avoiding superfluous data selection and by 
using the most specialized function appro- 
priate to your needs, you can increase the 
efficiency of your programs. 


CONTEST REMINDER 


Results of the contest published in 
the September/October Issue of NEWS will 
be announced in the January/February 1978 
issue -- look for them! 


CORRECTION 


Please accept our apologies for our 
rank error on page 6 of the "Whizbang" in 
the September/October 1977 issue: 


The expression to demote a numeric array 
should be 


A«+pR © L+(x/A)p0 © LL I+,R © R+ApL 


Eliding ravel (,) causes an error for non- 
Singleton arguments of rank 2 or higher. 
(Thanks to Georges Brigham, Bob Brown, and 
George Cherlin for catching this.) 


services, and universities of North 
America and Europe attended the sessions. 
Representing STSC were Philip Abrams, John 
Gilmore, Don Link, John Myrna, Bob Smith, 
and Roy Sykes. 


Numerous topics of continuing inter- 
est to the APL community were discussed at 
the workshop, including the transfer of 
APL workspaces at the source level, 
general arrays, standardization of APL 
implementations, extensions to APL, and 
secure packages written in APL. Asa 
result of the talks, STAPL members agreed 
to publish and support a standard for 
workspace interchange and to recommend an 
APL notation for handling complex numbers. 


Detailed reports of the meeting are 
printed in the December 1977 issue of the 
STAPL journal, APL Quote Quad. Proposed 
Standards will also be published in upcom- 
ing issues of Quote Quad, as will several 
of the informal papers presented at 
Minnowbrook. For information on obtaining 
copies of Quote Quad or joining STAPL, 
write to Marilyn J. Pritchard, Scientific 
Time Sharing Corporation, 7316 Wisconsin 
Avenue, Bethesda, Maryland 20014. 


MYRNA SPEAKS TO COMMUNICATIONS MANAGERS 


John Myrna, former Manager of Commu- 
nications and now Director of Development 
at STSC, was a guest speaker at the Data 
Communications Forum in Arlington, Vir- 
ginia, in December. The Forum was spon- 
sored by the Institute for Professional 
Education, Inc. More than 50 data commu- 
nications managers from industry and 
government attended. 


In his talk, "Why You Should Consider 
Packet Carriers", Myrna focused on the 
cost-effective use of packet communica- 
tions based on STSC's experience. He 
discussed the major factors in evaluating 
and selecting effective data communica- 
tions services: cost, quality (reliabil- 
ity, availability, and response time), 
scope, and image of service. 


Anyone wishing to obtain a copy of 
Myrna's paper should contact him at 
Scientific Time Sharing Corporation, 7316 
Wisconsin Avenue, Bethesda, Maryland 
20014, (301) 657-8220. 


JUST OFF THE PRESS 


Publications for two major STSC 
applications were completed in December: 
the FPS User's Guide -- Part 1 and the 
EMMA User's Guide. Brief descriptions of 
these and the revised edition of the 


Phones-Sign-On Card are given below. 
FPS USER'S GUIDE -- PART 1 


Part 1 of the Financial Planning 
System (FPS) User's Guide provides an 
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overview of FPS procedures and shows how a 
simple financial planning scenario can be 
modeled and reports produced. Part 2 will 
describe the basic features in the system, 
and Part 3 will describe the advanced 
features. Parts 2 and 3 are currently 
under preparation and will be available 
this spring. (Part. 41, 62:00) 


EMMA USER'S GUIDE 


The EMMA User's Guide introduces EMMA 
to application designers, programmers, and 
other users of APL*PLUS Service. EMMA is 
a powerful set of tools for managing large 
amounts of data in an APL*PLUS System 
file. Brainchild of Roy Sykes, STSC's 
Master Whizbanger, EMMA can be used to 
implement a wide variety of designs for 
database management applications. The new 
manual covers the major aspects of cre- 
ating, modifying, and changing an EMMA 
file. Using a sample database, the manual 
demonstrates the most commonly used EMMA 
utility functions. More detailed techni- 
cal information and descriptions of all 
utility functions will appear in the EMMA 
Reference Manual, to be published in 


February. ($5.00) 


PHONES/SIGN-ON CARD (NOVEMBER 1977) 


This card provides a combined list of 
APL*PLUS Service access numbers for all 
Supported networks, systems, terminals, 
and speeds. It also contains sign-on 
procedures for the networks and a list of 
messages that may be displayed at the 
user's terminal during sign-on. The 
November edition supersedes the April 1977 
edition of the card. (N/C) 


by Roy A. Sykes, Jr. 


WHY NOT TO LOOP 


Most APL systems are implemented as 
interpreters. This allows users great 
flexibility in creating and modifying 
programs. There is no need for compila- 
tion, and errors can be diagnosed and 
corrected during execution. 


The major penalty of an interpreter 
is the cost of syntax analysis and dynamic 
data checking. Fortunately, APL functions 
tend to be quite short relative to pro- 
grams in other languages, and therefore 
the interpretive overhead is minimized. 


On the other hand, when looping is 
performed in APL, the interpretive over- 
head skyrockets because the same code must 
be reanalyzed for each iteration. The 
general rule, therefore, is to try to 
minimize looping in APL. 


As anyone familiar with APL will 
attest, most programs can be written with 
no loops at all. APL's rich set of 
primitive functions and operators allows 


"closed form" (nonlooping) solutions to 
many complex problems. The control 
Structures that must be written explicitly 
in other languages (e.g., DO loops) are 
performed implicitly in APL primitives 
(e.g., Scan). But difficulties can arise 
when people who have been schooled in 
languages like FORTRAN, BASIC, and COBOL 
try to move their looping programs intact 
into APL. 
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For example, the following portion of 
a FORTRAN program calculates cash flow for 
a series of 360 investments using variable 
interest rates. DEP is N+l deposits, the 
first being the initial amount; RATE is N 
interest rates between 0 and 1; CF is the 
result. 


DIMENSION CF (361) 
DIMENSION DEP (361) 
DIMENSION RATE (360) 


CF (1)=DEP(1) 
DO 20 I=1,360 
20 CF (I+1)=DEP(I+1)+(CF(1I) *RATE(I)+1.) 


This segment can be translated almost 
directly into the following APL program: 


V CF+RATE CASHFLOW DEP;I;N 
[1] CF+DEP 
[2] N+pRATE 
[3] +N+¥0 © I+1 
C4] L20:CFCI+1)+DEPCLI+1)]+CFPCIIJxRATELIJ+1 
[5] +L20xN2>I<«I+1 
y 


and will run successfully, albeit slowly: 


(pDEP) ,pRATE a 30-YEAR MONTHLY LOAN 
361 360 

T+QAI © CF+RATE CASHFLOW DEP © DAI-T 
0 2070 483 O (0,MILLICRUS ,MILLISECONDS ,0) 


More than 2 computer resource units are 
required to run CASHFLOW -- not a very 
cost-effective solution! 


Rather than entirely rethinking the 
problem, some users will take great pains 
to optimize the existing algorithm. This 
approach is fine if no alternative algo- 
rithms are considered, but it often 
results in a more obscure program: 


V CPER CASHFLO2 DsIsd3N3T 

C1] T+D([I+1] O D*1+CF+«D 

L2] R+R+1 

C3] +NV+((pR)pL),0 

C4] L:T+CFldJ+I+1J)*+DCII1+7xkR(I) © -NLI<J7] 
y 


The techniques used in CASHFLO2 all 
serve to reduce interpretive overhead in 
the loop on line [4]. Interest rates R 
and branch targets W are calculated in 
advance, and subscript calculations are 
Simplified. (Note that the use of one- 
letter variable names does not affect the 
cost of running the program.) 
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T+OAI © CF*RATE CASHFLO2 DEP © DAI-T 
0 1301 330 0 
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We've reduced the cost by one third -- a 
Significant savings, but still not enough 
to be cost effective. 


By far the most effective approach is 
to reformulate the problem into an APL 
solution, rather than a warmed-over 
FORTRAN solution. Consider the following: 


V CF*RATE FASTFLOW DEP . 
C1] CF+1,x\RATE+1 © CF*CFx+\DEPtCF 
v 


CF is set first to the cumulative present- 
value discount factors for future depos- 
its. The deposits are divided by those 
factors, summed, and then multiplied by 
the factors to produce the result. A good 
exercise is to derive this algorithm from 
the looping algorithm; the key is that 
multiplication and division are both 
distributive with respect to addition 
(division is only right distributive). 


FASTFLOW is indeed an elegant and 
inexpensive solution to the problem: 


T+UAI © CF*+RATE FASTFLOW DEP © QAI-T 
0 26 7 0 


It performs 50 times faster than even the 
optimized looping solution, and illus- 
trates the benefits that can be derived 
from writing nonlooping APL code. 


CONTEST RESULTS 


The pA+B contest appearing in the 
September/October 1977 issue of NEWS 
prompted a wide variety of imaginative 
responses ranging from 10 to 84 charac- 
ters. Many did not work, apparently 
because the conformability requirements 
for scalar dyadic functions were not 
clearly understood. A+B will work if 


(1A.=pA) v 
(Taye@eB) v 
((ppA)=ppB)A(634+pA)A.=634pB 


That is, no LENGTH or RANK errors will 
occur if either argument to a scalar 
function is a singleton (a one-element 
array of any rank). Did your solution 
work for ‘2°33 #°+-1't ‘97? F? If not; you 
were probably assuming that the shape of 
the result was that of the argument having 
higher rank -- not always true! The rule 
is 


(pA+B) IS (pB) 
IF (1#x/pB) 
OR (1=x/pB)A(1=x/pA)A(ppB)>ppA 
ELSE (pA) 


Martin Gardner (STSC) and Paul 
Penfield (MIT) said it best: 


((~1e€p,B)/pB) ,(1€p,B)/(pA),(1€p,4)/(ppA)+pB 
We're pleased to award them each a dinner 


for two for their additional contributions 
to the art of APL! 


ABRAMS ADDRESSES COMPUTING COMMUNITY 
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ON ADVANTAGES OF APL 


Philip Abrams, STSC Vice President - 
Operations, spoke at the annual ACM 
Computer Science Conference held in 
February in Detroit. ACM (Association for 
Computing Machinery) is the national 
organization of computer science profes- 
sionals. As one of three invited speak- 
ers, Abrams shared the lectern with John 
Backus, the principal inventor of FORTRAN, 
and Anthony Hearn, an authority on artifi- 
cial intelligence. 


With characteristic wit, Abrams 
titled his lecture "APL Is Not for Every- 
one". Although the talk examined APL 
critically, it did so in such a way that 
the language's many strengths and virtues 
became readily apparent. By this indirect 
approach Abrams believes he was able to 
persuade many computer scientists in the 
audience to recognize the advantages of 
uSing and teaching APL. 


Students in the Principles of APL 
Programming class at North Carolina State 
University in Raleigh recently enjoyed a 
lecture given by Janet Faltz, STSC South- 
east Branch Manager. 


For many of the students, this was 
their first exposure to using APL as a 
problem-solving tool outside an academic 
environment. Features of the APL*PLUS 
System that set STSC's APL system apart 
from those of other APL and general time 
sharing vendors were presented. Factors 
that contribute to STSC's high level of 
service, including reliability, security, 
and personal expertise, were also dis- 
cussed. After the lecture, students were 
able to ask questions about how they could 
best use APL in their field of interest 
once they completed their studies. 


NEW UTILITY WORKSPACE PROVIDES TOOLS 
FOR TIME AND DATE CALCULATIONS 
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The new workspace 6 DATES on the 
APL*PLUS System contains a complete set of 
utility functions for manipulating dates 
and times. Included are functions that 


o convert numeric dates and timestamps 
to various formats 

calculate differences between dates 

compute days of week for given dates 
convert between time zones 

verify dates and leap years. 
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The examples below were run on 
23 January 1978 to illustrate the use of 
several functions in the workspace. 


What is the current Eastern Standard 
Time? 


O«+TIME*<StO7TS 
1978 1 23 20 2 
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Spell out the current time in two 
different formats: 


1 DATESPELL TIME 
JAN 23, 1978 8:02 PM 
7 DATESPELL TIME 
MONDAY, JANUARY 23, 1978 8:02 PM 


What time is it in Los Angeles and 
London? 


7 3 DATESPELL TIME 

MONDAY, JANUARY 23, 1978 5:02 PM 
7 5 DATESPELL TIME 

TUESDAY, JANUARY 24, 1978 1:02 AM 


On what day of the week was Abraham 
Lincoln born? 


DAYSCLDAYOFWK 1809 2 123] 
SUNDAY 


How many days are left until tax 
returns are due? 


78 4 15 DAYSDIFF 3+tUTS 
82 


What is the timestamp of workspace 
6 DATES? 


UWSTS 
3.385395388F10 


Convert the workspace timestamp to a 
numeric vector representation, and format 
the workspace timestamp: 


FPTIMEREP WWSTS 
1978 116 11 16 4 733 

FTIMEFMT [(WSTS 
1/16/76 TT 1670F, 733 


The functions in workspace 6 DATES 
supersede similar functions in workspaces 
1 UTILITY and 1 FILEAID, and most execute 
considerably faster. For more informa- 
tion, see DESCRIBE in workspace 6 DATES. 


WHIZBANG OF THE MONTH 


by Roy A. Sykes, Jr. 


CHARACTER SEARCHES 


With the recent introduction of the 
system function DSS for character string 
searching, and the major speedups to 
character ^a.= and v.#, a variety of new 
techniques are now practical. Although 
these techniques were usable before the 
enhancements, they were often infeasible 
because of WS FULL conditions or the large 
number of computer resource units needed 
to accomplish them. Frequently, obscure 
or clumsy code was written to circumvent 
the previous snailish pace of a.=. 


For example, a typical table lookup 
MATRIXA.=VECTOR 


was occasionally coded as 


A/MATRIX=(pMATRIX) pVECTOR 


to reduce cost (at the risk of WS FULL). 
Now, the first expression is many times 
faster. 


The improvements to a.= also dramati- 
cally altered the relative speeds of the 
six different string-searching algorithms 
presented in the March/April 1977 issue of 
NEWS. In particular, SS2, SS3, and SS4 
all use a.= for part or all of the search. 
Below is a table comparing their perform- 
ance on 25 February 1977 with their 
current performance and with that of their 
Oss-equivalent (R<«(A USS B)/ipA). The 
times are in milliseconds. 


B oR 25 FEB 1977 24 FEB 1978 

SUBSTRING HITS S52 SS3 SS4 S52 S53 SS4 OSS 
> 3683 19 2 2 8 2 2 6 
no 783 60 8 8 1a 8 8 8 
Ta" 0 56 2 2 6 2 2 2 
i ' 342 105 27 28 10 18 19 pi 
"AN! 33 104 10 9 v; 8 8 = 
‘aw' 0. 103 3 4 6 3 4 2 
! PHE" 20 204 55 9 8 24 7 5 
"THE * 16 203 14 9 fi 9 7 3 
"MATRIX * 5 300 15 5 8 9 6 2 
'RATMIX' 0 201 29 5 8 ia 6 2 
* PERMITS" 15 403 113 17 10 41 10 5 
"PERMITS !' 15 401 16 t7 8 9 9 2 
'NOTFOUND.! 0 399 22 19 8 10 zo 2 
"ae1OUup~w' 0 399 3 4 8 3 5 1 


(Executed 24 Feb 78 at 6:46 PM EST 
on APLPLUSC/470-10011 with 34 users) 


Although they still suffer a propensity 
for WS FULLS with large arguments, it's 
nice to know their cost has dropped so 

dramatically! 


Pleasantly, the discussion of string- 
search techniques is now moot because we 
have OSS. In fact, even if DSS weren't 
available, the following simple function 
would perform the same task: 


V A+A ASS B 
C1] R+pA © R+RpBA.=((p,8) ,R+xRh) pA 
V 


The function ASS takes roughly twice as 
long as QSS to perform a given search. 
Moreover, DSS has the advantage of 
requiring no additional or intermediate 
workspace storage. 


Not only does SS provide cost and 
Storage advantages, but it also allows 
novel or simpler approaches to problems. 
Below are a few examples. 


Determine if function DSPELL is currently 
suspended or pendent; that is, whether 
it's in the state indicator. 


FN+'DSPELL' 
OLD v/(((1+p0SI),1+p,FN)tOSI)A.=PN,'[' 
v#í((3tp0SI) 740i JA.="DSPELLL * 
NEW v/(,' *,O8Z)0Ss * ',Fw,'Ct 
v/(,* *, DST MISS -t Sree 


mH OQ D> to 


y 


AnuPrnnawavrs 
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Mark the line-ending carriage returns in a 
function. 


VR+-DVR 'FNNAME' © VRCOIO-6-pVRI<'LC' 
OLD (VR=OTCNL)A1OVR='C! 
NEW VF 09s [TORLI 


And finally, here's a function that 
locates a string in any row of a matrix: 


V R«+M ROWSS V 
Ba Rev /C0,1-p,V)4( pM) 0(,M) OSS V 
y 


Oss locates all occurrences in the rav- 
elled matrix, and its result is reshaped 
to the shape of the matrix. Then, those 
columns that might identify matches 
spanning more than one row are dropped. 
Finally, the v/ singles out the proper 
rowS. ROWSS will locate matches in a 
character matrix: 


pSTATES 
50 14 
(STATES ROWSS 'NIA')#STATES 
CALIFORNIA 
PENNSYLVANIA 
VIRGINIA 
WEST VIRGINIA 
(STATES ROWSS 'LIN')#STATES 
ILLINOIS 
NORTH CAROLINA 
SOUTH CAROLINA 


Readers with interesting uses for DSS 
should contact the author of this article 
by returning the attached Reply Form or by 
sending a message to ROY in the STSC 
Mailbox. The most imaginative uses will 
be published in a future issue of this 
newsletter. 


WHIZBANG CORRECTION 


Gary Gordon (Comsat) pointed out that 
the relics of my FORTRAN coding ability 
apparently do not include function hier- 
archy. I do not regret that fact, but I 
include the following correction in the 
interest of journalistic accuracy. The 
last line of the FORTRAN program should be 


20 CF (1I+1)=DEP (I+1)+CF (1) * (RATE (I) +1.) 


(Remember, you're supposed to be convert- 
ing from FORTRAN to APL!) 


SOLUTION TO PUZZLE ON PAGE 3 


ee 


Remember? Now imagine trying to con- 
nect 100 dots without the help of ARIES. 


32 ACTDEMO 
Revised demonstration of some actu- 
arial applications in the ACT*PLUS 
System (Library 32). 


99 FCGENFNS 
(File conversion generator.) Con- 
verts data from an external medium to 
APL*PLUS System files. 


99 WSTRANSFNS 
Heavily commented functions to aid 
the user in developing a source level 
transfer system. 


DELETED WORKSPACES 


99 FILETRANSFER 

Superseded by 99 SLT. 
99 WSTRANSFER 

Superseded by 99 SLT. 
32 ACTHOW 

Superseded by 32 ACTDEMO. 
32 ACT1 

Superseded by 32 ACTDEMO., 
32 ACT2A 

Superseded by 32 ACTDEMO. 
32 ACT2B 

Superseded by 32 ACTDEMO. 


NEW BYTE-SIZE BOOLEAN SCALARS 
ON APL*PLUS SYSTEM 


In March the internal representation 
of Boolean scalar constants on the 
APL*PLUS System was altered to reduce the 
storage required. Each Boolean scalar 
constant in a defined function now 
requires only 1 byte of storage instead of 
the 4 bytes previously required. For 
example, the function below contains five 
Boolean scalars; the workspace storage 
required for such a function is now 15 
bytes less than before. 


V R+INCR A 

[1] +(0#1+0pA)/ERR 

[2] R+At+1 70 

[3] ERR: ‘ARGUMENT MUST BE NUMERIC' 
y 


More details on how users are 
affected by this change are provided in 
item 361 in workspace 1 NEWS on the 
APL*PLUS System. 


WHIZBANG OF THE MONTH 
by Roy A. Sykes, Jr. 
BACK TO BASICS 


One of the beauties of APL is the 
ability to get useful results while using 
only a small subset of the language. 
Gradually, through curiosity and experi- 
mentation, users discover more advanced 
features which they can apply to their 
problems. In the interest of hastening 
that discovery, I'd like to introduce you 
to a powerful function in APL called base 
value. I'll discuss its performance on 
scalars and vectors only, leaving you to 
experiment with matrices if you wish. 
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Base value, also called decode, is 
represented by the 1 symbol (shift B). It 
is dyadic and accepts only numeric argu- 
ments. Let's see what happens in a simple 


case. 


101 8 29 5 
8295 


The effect of "putting numbers: together" 
1s actually the result of evaluating the 
polynomial expression 

8x* + 2x? + 9x + 5 


for x=10, or, for you APL'ers out there: 


X+10 
(8xX*3)+(2xX*2)+(9xX)+5 
8295 
Or 
5+Xx9+Xx2+XxB 
8295 


Notice that the coefficients are on the 
right and the independent variable is on 
the left. Thus, base value evaluates 
Olynomials. This may seem esoteric, but 
it actually proves quite useful. For 
example, you can use it to combine month, 
day, and year into one number, 


1001 6 5 77 A (10000x6)+(100x5 )+1x77 
60577 


Or to determine the number of seconds in 2 
hours, 10 minutes, and 38 seconds. 


601 2 10 38 wm (3600x2)+(60x10)+1x38 
7838 


Note that the following three expres- 
sions are equal: 


2+3x4 © 341 4 20 4L 3 2 
14 
14 
14 


From what you've seen so far, can you 
explain why 11VEC is the same as +/VEC; 
and why 01VEC is the same as ~1+*VEC? 


For scalars and vectors, A1iB is 
equivalent to +/WxB (or W+.xB), where W is 
a vector of weights derived from A. As 
we've seen above, if the left argument is 
a scalar, then (assuming DI0<+1) W is 
calculated as follows: 


W+- Ax(pB)-1pB 
60* 3 “4 2-3 
60* 2 1 0 
3600 60 1 


More generally, if the left argument is a 
Vector, 


A+ 0 7 24 60 60 1000 
Be 123 4 5 6 
A1iB 

788645006 


base value is evaluating a mixed radix 


polynomial. In the example above we're 
calculating the number of milliseconds in 
l week, 2 days, 3 hours, 4 minutes, 

5 seconds, and 6 milliseconds. In this 
case the left argument represents: 


0 (placeholder) 
7 days per week 
24 hours per day 
60 minutes per hour 
60 seconds per minute 
1000 milliseconds per second 


The weighting vector is computed by 
performing a multiplication scan from 
right-to-left on all but the first element 
of the left argument, and catenating a i 
to that result: 


W+O1,x\O14A 

W+o1,x\O140 7 24 60 60 1000 

W+O1,x\O7 24 60 60 1000 

W+>1,x\1000 60 60 24 7 

W+O1,1000 60000 3600000 86400000 604800000 

W+O1 1000 60000 3600000 86400000 604800000 

W+604800000 86400000 3600000 60000 1000 1 
week day hour min. sec. ms 
(milliseconds in) 


Hence we multiply the number of milli- 
seconds in a week by the number of weeks, 
add it to the number of milliseconds in a 
day multiplied by the number of days, and 
so on, to produce the result: 


+/WxB 
788645006 


If either argument is scalar, it is 
extended to the length of the other 
argument, and the same weighting vector 
calculation is used. Thus the following 
expressions are equivalent: 


32 342 32 4 3 


314 
22 12 29 è 2,242 
314 
2332 3 
314 
60 ı 10 0 120 
36120 
60 60 60 1 10 0 120 
36120 
0 60 60 1 10 O 120 
36120 
7743 60 60 ı 10 0 120 
36120 


As illustrated, the first element of 
the left argument does not affect the 
result, but serves only as a placeholder. 
If either argument is not a scalar or one- 
element vector, the lengths of the argu- 
ments must match. 


60 60110 0 120 
LENGTH ERROR 
60 60 1 10 O 120 
A 


This rule is identical to that used by 
inner product (for example, +.x), although 
in inner product the first element of the 
left argument does affect the result. 


Mm OQ DP 


>) 
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SOME USEFUL APPLICATIONS 
Right justify a character vector V: 
(1-(V=" ")11) OV 


Actually, this expression will right 
justify an array of any rank. It uses the 
"reverse multiplication scan" weighting 
vector calculation as an and-scan (A\), 
and the +/Wx1i to calculate (i+) the number 
of contiguous ending spaces. 


The Whizbang in the January/February 
1978 issue of NEWS contained the function 
FASTFLOW to compute a cash flow series for 
a starting balance (SB) and N subsequent 
deposits (DP), all compounded at varying 
interest rates (IR). 


V CF*+RATE FASTFLOW DEP 
C1] CF+1,x\RATE+1 © CF*CFx+\DEP+tCF 
y 
SB+ 100 
DP+ 0 50 100 216 0 123.26 
IR+ 0.2 0.25 0.08 O 0.05 0.06 
IR FASTFLOW SB,DP 
100 120 200 316 100 105 234.56 


If you don't need the intermediate values 
and only wish to know the ending balance, 
the following expression will suffice: 


(0,Z2R+1)1S5B,DP 
234,56 


Given an array YM containing months 
packed as YYMM or YYYYMM, add A months and 
return the dates in the same format: 


V R+A MONTHADD YM 
[1] R+1+100110000 12TA+12110000 100TYM-1 
y 
7 MONTHADD 197805 7806 197912 
197812 7901 198007 
~5 11 16 MONTHADD 197804 
197711 197705 197612 
(16) MONTHADD 197806+16 
197808 197810 197812 197902 197904 197906 


This is a lovely illustration of the 
relationship between decode (1) and encode 
(tT). It uses both to convert back and 
forth between decimal and base-12 repre- 
sentation. 


WHIZBANG CORRECTION 


The function ASS, as published in 
Whizbang in the March/April 1978 issue, 
emulates OSS most, but not all, of the 
time. It fails when the right argument 
matches the last and first few elements of 
the left argument: 


"UGH, HERE''S THE RUB' ASS 'BUG' 
0Oo00000000000000000 1 


The following version corrects this 
overlap problem: 


V RA ASS B;C 


[1] R+pA © C+p,B © R+Rt(-C)+BA.=(C,R+xR) pA 


V 
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Keeps Customers Up-to-Date 
On Shipping Services 
by Jean McLanahan 


Editor’s Note: The main objective of the Operations 
Department at Scientific Time Sharing Corporation 
is to provide customers with the best possible 
service. An important part of that service is shipping 
and receiving customer data. Jean McLanahan, 
Senior Coordinator of the Services Support Group, 
investigates different shipping services to provide 
customers with the most reliable service. 2 

- Beginning with this issue, Jean outlines, in a 
series of five articles, the various shipping services 
available to customers. In this first article of the 


series, general shipping procedures are described 


and Federal Express service is evaluated. 


The Services Support Coordinators in STSC’s 


Operations Department are responsible for ship- 
ping and receiving customer data such as fileprints, 
punched cards, and magnetic tapes. Coordinators 
are staffed on two shifts to assure continuous and 
reliable service. , 

The First and Second Shift Coordinators, Mary 
Edith Kovach and Valerie Quinn, prepare the ma- 
terials for shipment. Frequently, packages and 
paperwork are prepared in just minutes so that 
pickup deadlines can be met. Unless instructed 
otherwise, shipping charges are. prepaid by STSC 
and added to the customer's batch services invoice. 

Periodically, the various shipping services are 


compared with respect to their rates, pickup and | g 
delivery times, limitations, insurance, tracking abil- 


ity, and reliability. This information is especially 
helpful In the event of short notice deadlines. 
-»<The paragraphs that follow describe and eval- 
uate Federal Express service. Forthcoming issues 
will cover Emery Air Freight, bus services, airline 
“small-package” services, and Central Delivery. 
Methods of Shipment | | 


Priority One — Next business day delivery in. 
most U.S. cities and in selected Canadian cities. _ 
Saturday delivery is available for an additional | 
$6.00 per package, if recipient is in a Federal. — 

DMA ES ASE @yist?” This article discusses some techniques for 


ete 


Express service area. 


~ Standard Air Service — Second business day _. 


delivery in most U.S. cities and in selected Cana- matr gba 
(ae) The most fundamental operation isan equality 


_ comparison between two scalars: | 


dian cities. No Saturday delivery. “a 


=S Courier Pak - Next business day delivery ‘in | 
most U.S. cities and in selected Canadian cities. . 


Shipments must not exceed two pounds and have a 
flat rate of $14.00. Saturday delivery is available for 
an additional $6.00 per package, if recipient is ina 
Federal Express service area. This method of ship- 


ment is best for letters and small fileprints. We don't | 


Future articles in News will keep you informed 
on other aspects of the conference and provide 


Information — gg a a recommend it for magnetic tapes because the 
shipping envelopes are not padded and, therefore, 


~ do not afford adequate protection for tapes. 


Rates ee 


Reasonable for overnight ‘freight service 


STSC Services Support Group | Š ; 


T Competitive with other overnight freight services. 


Insurance 


The first $100 of insurance is free with all three 
methods of shipment. Additional insurance can be 
purchased for $0.25 per $100 or fraction thereof. 
Priority One and Standard Air Service packages can 
be insured for a maximum of $2000. Courier Pak 
packages can be insured for a maximum of $200. 
We don’t recommend purchasing additional insur- 
ance, except when communications equipment or 


terminals are being shipped. 


Limitations ` 


< Priority One and Standard Air Service pack- 
ages must not exceed 70 pounds; Courier Pak 
packages must not exceed 2 pounds. 


- Tracking Lost Shipments 


-~ Very good. Tracking is prompt, and progress 


reports are provided at regular intervals until the 
problem is resolved. — 
Reliability See ee Pe 

Excellent. We recommend it highly. 

For more information on Federal Express ser- 
vice, contact your local Federal Express office 
(listed in the Yellow Pages under “Air Cargo”) or call 
the Federal Express toll-free number 800-238-5355 
(in Tennessee, 800-542-5171). 


~~ Acommon problem in any computer language 
-is determining if an object exists in a group of 
‘objects. For example, “Is 1403 one of our customer 


~ numbers?” Or, “Does part number A4-103478-Y 


searching collections of objects (e.g., vectors and 
matrices) for possible member objects. 
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If we extend either argument to a vector, we get a 


corresponding Boolean vector indicating whether 
or not the elements of the arguments match: 


News July/August 1978 5 


1403= 1374 1413 1403. 1304 982 2605 
0.) 4,.0-0-4 


By performing an Bereductton RA: on such a 
Boolean vector, we ask the question “Is it ever true 
that...?” or “Is there any case where.. o- 


. 


Conversely, and- reduction (fa answers itt “Ghee ; 
.?” or Is it it aava ye ne, D 


tion “Is it ate true that . 
case that. ..?” 


0 i 
by using the membership function (€): » 


1403c 1304 1413 1403 1374 982 2605 
4 | 


The above statement can be read as “Is 1403 an : 


element (member) of the right argument?” 
Membership also handles multiple elements 


on the left and returns a conesponding. 5001000 ae 


array: 
982 760 1304 € 944 1304 8 982 622 26 
1: 0.4 | 


Thus, the statement a/,AcB answers thes ques: 


tion “Is every element of A contained in B?”. 

We can extend the original case of 
V/SCALAR=VECTOR to arbitrary UE of. any 
rank by using outer product (°. 5): | 


V/Ac.=,B 


The above statement is functionally identical to 
AeB. Membership is typically more efficient, how- 
ever, because it can stop searching for a given 
element once it has located the elementin the right 
argument. Outer product must compare all ele- 
ments against each other. 


A list of customer numbers is usually stored as ~ 


a vector wherein each element (a scalar) represents 
a single customer number, membership treats this 
case directly. Suppose, however, that the objects 
that we’re searching for are not scalars, but vectors. 
For example, a list of names is usually stored as a 
matrix wherein each row (a vector) represents a 
single name: : 


oNAMES © NAMES 
6-9 
HAGENBUCH 
CATELAIN 
GRAZULIS 
SMITH 
MATSUMUNE 
GARDNER 


If we use membership to determine whether 
the name CLAUSEN appears in the list, we'll find 
thatit doesn’t work since each element of the result 
ismarked 1 ifits corresponding elementin the left 
argument is found AAA in ue o gga: argumeni 


'CLA USEN! <WAMES 
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“v/1403= 1304 1413 1403 1374 982. 2605 ae 


A/1403= 1304 1413 1403 1374 982 2605 


set is eae ee a Pras eee sa 
to determine mii any names alch ine right 


What we really want to dois compare the elements 
of 3+'CLAUSEN' with. the corresponding ele- 


ments of each row and perform an and- reduction to 
assure that all positions are matched; an ee 


ea 


Ler 14 p NAMES 


Ş 
gY 


Gee 


Br PR t ; 
A a 
i n $ 
a 


A more rect and efficient way to ua thisi is stou use an is 
- ^. = inner product: ETIE Ne A ee 
The or-reduction case is more directly coded ie ee 


om pee 


TE eravam 


0-0 0- 0 00 


NAMESK. SLAT! 'GRAZULIS* 


0 01000 a: : Ss 


Thus we get one element for sen row in the left | 


argument, and we can enter | 
- V/NAMESA. =! HAGENBUCH* 


argument (which must have the same number of 
elements as the left argument has columns). . 
What if we want to search for several names? 


oSEV © SEV~ 
GRAZULIS | = DERS 
SUWARA a a 
MATSUMUNE 

NAMESA.=SEV. 
LENGTH ERROR: ue 

NAMESA.=SEV = : 

bon. NEER he Pate ge atin te ee 


We find that, even though both arguments have 
nine columns, we’ve SOMenOW GOOLSS. Gor AA. =p 
the rule is Se Sale 5 


AERTS. 
16€CC,=/CC ; 


That is, the last coordinate of the left argument and 
the first coordinate of the right argument (the com- 
bining coordinates) must be equal, or either may be 
1, or either argument may be a scalar. The shape 
of the resultis (~1+pA),1+pB (i-e., after the two 
combining coordinates are checked for conform- 
ability, they are processed and deleted). Leťs try 
getting the two engt -9 coordinates together... | 


NAMESA. REV | ieee: 
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It appears that we’ve compared all rows Sof NAMES 
against all rows of SEV, much ike fos 


Aco.=B. 


compares all ene of A againstallelements of 


B. Performing an or-reduction em the UGK CoO- xi 


ordinate returns the cesreg result: R N T - "i E I Sh Seo a a 


VANAMESA, =RSEV_ 
13) 4 


Alternatively, we can interchange arguments and 
perform an or-reduction along the last dimension: 


To generalize somewhat, given a ector Jy, 
the expression ; 


v/Ac.=V aA EQUIVALENT. TO Aey. 


determines whether the individual elements (scar g 
lars) of array A existin V. The o pocine exDressiop : 
z for, matrices İS ` 


Wwhich*c given : a matrix M, dereire Meinar the 


individual rows (vectors) of array B (having the 
same number of columns as M) existin M. 
The Whizbang in the next issue of News will 
discuss techniques for determining not. only 
whether an object exists in an array, but where it 
exists. See if you can anticipate me by writing a 


V/SEVA. =QNAMES : matrix equivalent for dyadic iota (VEC 1A). 2 na i 
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O | would like more information about APL*PLUS Service for the application 


O | would like to order an STSC publication. Please send me a Publication Order Form. 
O Please register me for the next three-day Introductory APL class in city 


O Please add the individuals listed below to your mailing list and send them a copy of this newsletter. 


O Please correct my address and/or telephone number as shown below. 


O Please remove my name from your maang, list. - 
Comments: 
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International 


New Publications Aid 
International Marketing Efforts 


In January 1978, APL* PLUS Service was made Areik 


able in Europe through offices located in Düsseldorf, 


London, Madrid, and Paris. With this expansion of the se 
marketing force, the need for publications, in, the. Ta 


language of each country became apparent.” 


Nancy P. Rosebush, International Publications ees 
Coordinator, joined STSC’s Publications Department. 


in June 1978 tocoordinate the translation and produc- 
tion of publications in languages other than English. 
Since that time, she and the staff in the European 
offices have produced the following publications: 


e QUICKPLAN™ Promotional Brochure. De- 


scribes how the APL*PLUS Quick Planning and 
Reporting System aids corporate planning and 


reporting processes. Available in arenei, Gers 


man, and Spanish. 


e CMCS™ Promotional Brochure. mid the e 
major applications available with the APL*PLUS | 
Comprehensive Manufacturing Control System: 
forecasting, inventory management, production ~ 


scheduling, materials requirements planning, 


shop floor control, and warehousing and distr: ae. 


bution. Available in French. 

e French and German Company Brochurest 
Glossy brochures introducing the French and 
German companies to their apee communi- 
ties. 

e EMMA™ Fact Sheet. Introduces the EMMA 
Data Management System, a system that simpli- 
fies the management of large amounts of data. 
Available in German. 

Copies of these publications can be obtained free 
of charge from APL*PLUS International, New York, or 
from the appropriate European office. (Addresses for 
all offices are listed on the back of this newsletter.) 

Translations of other publications are already. 


underway. Watch “Just Off the Press” in future issues 


of this newsletter for publication announcements. 


APL79 Conference Shaping Up 


The APL79 Conference, to be held in Rochester, New 
York, from 30 May through 1 June 1979, will feature 
contributed papers on all aspects of APL as well as 
invited papers. It is the eleventh in a series of inter- 
national conferences sponsored by STAPL, and the 
first to have invited papers. 

STAPL, an international group of APL profession- 


als, is affiliated with the Association for Computing ~ 


Machinery. Besides sponsoring conferences like 


APL79, STAPL publishes APL Quote Quad, the princi- Ta 
pal technical journal serving the APL community. The __ 


current Chairman of STAPL is Dr. Philip S. Abrams, Vice 
President-Operations of STSC. - 

According to APL79 Program Chairman, Professor 
Paul Penfield, Jr., of the Massachusetts Institute of 
Technology: S eA 

The purpose of the invited papers is to penean the 

perspective of APL users, allowing them to see their 

ianguage in a context that includes other languages, 
nontraditional uses of computers, and applications that 
may not be well suited to APL. We have extended 
invitations to an outstanding group of people, mostly ~- 


outside the APL community. Topics to be covered in 


~- these papers include data models, general arrays, 
operators in APL, relations between APL and other ~~ 
languages, and uses of computers oe tnan RE E 
traditional algorithmic style. | dec 


As for the contributed papers, over 200 Arit | 
were reviewed in September by a group of knowledge- — 
able referees and by the Program Committee. Two-. 
thirds were selected for the next step, and the authors | 
of those abstracts were asked to submit complete 
papers. The papers are now being reviewed, and the 
Program Committee will select the best ones for- 
presentation at APL79 and for publication in the ~— 
Conference Proceedings. Penfield reports: “We were ` 


_ pleased with the abstracts and alte the final Cee = 


to be of high quality.” 

Registration information can be SRATI ON 
Lynne C. Shaw, APL79 Registration Sale tO K 
Box 40004, aU? new Melis 46023 gree 


by Roy A. Sykes, Jr. 


The “All But” Case 
Author's Note: (II 0+1 throughout. 


Most people think positively. For arte most would 
ask “How many customers pay on time?” rather than 
“How many customers are not delinquent?” Ina similar 
vein, an APL programmer is more likely to write 
(BAL<0)/CUSTNOS than (~BAL20)/CUSTNOS 
to select all customers with negative balances. But 
occasionally programmers are asked to solve “all but” 
problems, forcing them to think negatively. 

Fortunately, APL provides facilities to allow either 
direct coding of negative inquiries, or their transfor-- 
mation to positive requests. The example above is 
illustrative. Perhaps the most direct “all but” function is 
drop (+). The statement 5 2 +MATRIX can be 
interpreted as “Return all but the first 5 rows and 
last 2 columns of MATRIX.” — 

One characteristic of most practical “all but” at 


H lems is a limited universe of discourse. That is, we can 


generate the first five positive integers (15), but we ~ 


can’t generate all but the first five positive integers” | 
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(5+ıL/10) because, practically speaking, it's an 
infinite set. 

We normally need a fixed and manipulable popula- 
tion which can be divided into two groups — the 
“haves” and the “have nots”, if you will. 


Some Examples 


Consider the following problem: Saies regions are 
stored in a database as single arbitrary character 
codes with an associated two-variable directory 
(SRCODES isthe vector of codes; SRNAME‘S is the 
matrix of names). Marketing reorganizes and adds 
three new regions. Assign the next three sales region 
codes. 
The practical solution is 
NEWSRCODES+3+ (~DAVeSRCODES) /DAV 


because DAV is small enough to manipulate and is 
divisible into two groups. The interpretation is “Return 
the first 3 elements of all characters, except those 
already used.” But whatif SRCODES contained posi- 


tive integers instead? Even if we could generate all = 


positive integers (API+1L/10), it would be mon- 


strously impractical to attempt seeks bor 84 | 
NEWSRCODES+3+(~APIeSRCODES)/API — 
Instead we would resort to some other algorithm, such 


as oe ee ARETE, 
NEWSRCODES+(13)+[/0,SRCODES 


A similar problem is finding an arbitrary tie number 
for a file. The tie number will be one of the positive 
integers that is not already tied to a file, as given by 
(JFNUMS. Our universe of discourse is all integers 
between 1 and 1+2%*31, which is plainly too big to 
manipulate. One approach is l 

TIENO+1+[/0,DFNUMS 


but this fails if ( 1+2*31)eDFNUMS — that is, if 
another file has already been tied to the largest 
allowable value. 

The solution is to limit the universe of discourse to 
some practical size. Consider: if there are N files tied, 
at least one of the numbers 1 N+ 1 is not being used as 


a tie number. Thus we can use 11+pL]FNUMS asour 


universe: SOPR T (i eyes 
U+11+pOFNUMS © TIENO+1+ (~UVeDFNUMS)/U 


Or, we canbe more succinct by using dyadic iota (1 ) 
to automatically return the next higher integer, if 
required: : PMs es S 
TIENO+«((1pDFNUMS) «0FNUMS)10 
“All But” Selection 


For compression (/) and take and drop (+ and +), “all 
but” selection is easy: 


B/VEC All elements where B=1. 
(~B)/VEC All elements, except those where 
B=1 (i.e., where B=0). 
N+VEC The first (or last, if W<0) W ele- 
ments. — 
N+VEC All but the first (or last) W elements. 


However, subscription doesn’t have a direct “all buti 
formulation. Indices represent not only selection criteria 


6 News November/December 1978 — 


but also ordering, replication, and structure information. 
For exampie, what is the “ali but” case in the following 
example? 


VEC+"BUSTLE' . 


INDS* 24p465516 5 4 
VECLINDS) . | eeu. 
TELL 
BELT 


Plainly we have to form some rules for “all but” indexing. 

Let us define “all but” indexing as selecting elements 
from an array in the order and structure in which they 
occur in the array. Further, let us state that each element 
selected is exactly one of those whose indices do not 
occur in the given set of indices. It all adds up to a case of 
compression: i Sg aa ye 

VECL (~(1pVEC)eINDS)/\pVEC] = 


Or, more directly, 

(~(1pVEC)eINDS)/VEC 
US sie. aa z 

Although the latter case is preferable for selection, 

the former is necessary for assignment: 

VECDL (~(ipVEC)e€INDS)/i1pVECI+*'E' 

VECH a S He 
BEETLE 


if pV EC is large, the epsilon (€) search for ıp VEC 
will be expensive. Thus an alternate approach for com- 
puting the compression vector is desirable: = = 
ALLBUT+(pVEC)p1 © ALLBUTLINDS]+0 ` 


We construct a Boolean vector that selects all ele- 
ments of V EC. We then assign 0’s to the indices we do 
not want to select (the same element of ALLB UT may 
be assigned several 0’s). ALLBUT is identical to the 
original compression vector. ? 


ALLBUT © ~(ipVEC)eINDS 
04" 1-0 ..0- 8 
O11 


00 0 
ALLBUT/VEC 
EE 
VECLALLBUT/1pVEC]+'‘AT! 
VEC 
BATTLE 


Another Application 


Finally, an unusual but useful “all but” application is 
determining all functions in a workspace except those 
specified. For example, if you have a group defined that 
contains the names of all subroutines required by a 
particular function, before invoking that function you 
may want to erase all other functions to increase 
working storage (OW A). l 
The traditional approach is clumsy: - 


yY TOGO+ALLFNSBUTGRP TOKEEP;COLS;ALLFNS 
Cij ALLFNS*ONL 3 © TOKEEP+OGRP TOKEEP 
es COLS+(1+pALLFNS)[itpTOKEEP 
[3] ALLFNS+((1+pALLFNS) ,COLS)+ALLFNS 
[4] TOKEEP+((1+pTOKEEP) ,COLS)+TOKEFP 
[5] TOGO+(A/ALLENSY .#QTOKEEP) 7 ALLFNS 

y cmi 


ALLFNSBUTGRP accepts a group name and returns 


_ the names of all functions not contained in the group. 


Sl AR i ra 


Line [5 } searches and returns all function names in 
the workspace that do not occur in the list of names to 
keep. 
But the innovative approach is truly bizarre: 

V TOGO<AFBG TOKEEP;LOCALFN 
ep TOKEEP+,';',U0GRP TOKEEP 
be TOKEEP+'VLOCALFN;LOCALFN' ,TOKEEP 
[3] eODEF TOKEEP,OTCNL,'C1]70GO+NNL 39' 

y NE 


AFBG defines a local function that localizes the names 


of all objects to keep. When the local function is = == 
executed, it returns the names of all functions definedat ~ 
that moment, which excludes the localized namesinthe . ~ 
header! Note that, because of shadowing, neither -= 


ALLFENSBUTGRP nor AFBG can return names of 
functions homonymous with names in its header. 


More on Dyadic lota | | 

Jordan Plitteris (Clairol) took a different (and better) tack 

when he solved the matrix dyadic iota problem from the 

September/October issue of News: 
DIO++7A\~MATRIXA, =QVECMAT 


VECMAT can be a vector or matrix anions last coordi- 
nate is the same as that of MATRIX. We can modify 
this basic algorithm to allow arrays of any rank in place of 
VECMAT and to substantially improve its spoe 


_ DIO++/A\ARRAYV .Z®MATRIX = 


Notice. that, with minor nodificatlone: this weaned 
TDR can De, SEE tO gente dyadic ae on — 
vectors: seen wate ke eG Se ee a T. 


č packs 


C110++/n\ARRAY*. 2VECTOR | 
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shipping terminal. A package going 150 miles or less 
will arrive the same day; a package going between 150 
and 500 miles (delivered to the shipping terminal any 
time on any given day) will arrive overnight. A package 
going between 500 and 1000 miles will arrive in 24 
hours, and a package going coast-to-coast will arrive in 
three days. When the package reaches its destination 
teminal, the consignee is notified by phone, or by 
postcard if he cannot be reached by phone. 


Rates 


Bus shipment is one of the least expensive shipping 
services available. Depending on distance, the costs 
are 10 to 60% less than the costs of air freight and 
airline small package services. (Charges for delivery 
to, or pickup from, the bus terminal via local courier are 
additional.) 


insurance 


Insurance is available at no charge for shipments with 
declared values of up to $50.00. Additional insurance 
can be purchased for a small charge on shipments of 
higher value. For fileprints, magnetic tapes, or 


punched cards, we think the minimum insurance is 
sufficient, provided a backup copy of the material is 
maintained until the shipment is received. 


Limitations 


Packages can weigh up to 150 pounds each, with 
three packages to a shipment, for a total weight limit of 
450 pounds per shipment. Any number of shipments 
can be made. 


Tracking Lost Shipments 


Tracking is good, though not as prompt as the over- 
night freight companies’ tracking systems. Results are 
usually obtained within 24 hours. 


Reliability 
We do not hesitate to recommend bus shipment, 
except when delivery time is crucial. Our assessment 
of the reliability of each service is given below. 

è Trailways’ Package Express - Very Good 

@ Greyhound’s NBO - Good 

For more information on shipping by bus, contact 
your local bus terminal listed in the Yellow Pages under 
“Express and Transfer Service”. 
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by Roy A. Sykes, Jr. 


Numeric Sorts 


While designing (in collaboration with Martin Gardner, 
STSC) and implementing the functions in workspace 
6 SORT, | investigated the properties of the APL 
grading functions (A and Y) and their application to 
sorting. This article will present a short tutorial on 
sorting (reordering) numeric arrays in APL. All exam- 
ples are in origin 1(DI0<1). 


Sorting Vectors 


Grade up ( A ) and down ( ¥ ) are marvelous concepts, 
although their definition is limited to vectors. Rather 
than actually sorting a vector, they return the indices 
that would sort the vector. For example, 


VEC+ 70 46 18 46 59 32 46 59 
O+INDS+AVEC 
ie ee Be 


The result is a permutation of 19 VEC indicating that 
the third element of VEC is the smallest, the sixth 
element is the next smallest, and so on. Consequently, 
we can use the resultant indices with subscription to 
rearrange (sort) the vector in ascending order: 


VECLINDS } 
18 32 46 46 46 59 59 70 


Why doesn’t grade order a vector directly, rather 
than requiring the subsequent indexing operation? 


The answer is that grade, being a more fundamental 
operation to express ordering, has a variety of other 
uses such as ranking and searching. You can also use 
its result to rearrange other arrays on the basis of the 
ordering of a corresponding vector. For example, given 
a matrix of data, you can order the rows of that matrix 
based on one column (in this example, the year’s 
column): 


DATES DATESCADATES( 31];] 
EFTS 10 31 1978 10 31 
1978 12 2 1978 1-2 2 
1979 1 5 LITO 5 30 
2978 11 30 1978 Be t3 
1979 1 16 1978 Et 30 
1978 12 i 1978 i2 5 
1978 rA 30 1979 4 5 
1978 12 5 1979 1 16 


Sorting Matrices on Keys 


More often we want to sort a matrix based on several 
columns. In the previous example, we sorted based on 
only the first column (years) and ignored the second 
and third columns (months and days). To order 
DATES properly, we want to rearrange by year, and if 
year is identical, rearrange by month, and if month is 
identical, rearrange by day. Another way of saying this 
is that year is the major sort key, month is the inter- 
mediate key, and day is the minor key. 

There are several ways to perform key sorts. One 
way is to sort by the minor key first, then by the 
intermediate keys, and finally by the major key: 


V R+DATASORT MAT;COL; DIO 
[1] DI0+1 © COL+(pMAT)[2] 
[2] A:*COL+B © MAT+MAT[AMAT[ ;COL];] 
Lar COL00L-L 6 4 
[4] B:R+«MAT 


V 
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The above technique represents a brute-force ap- 
proach, however. At each iteration we must rearrange 
the entire matrix; this requires a new copy of the matrix 
and consumes unnecessary CRUs. The more sophisti- 
cated approach given below rearranges only the 
column being sorted and the resultant indices. It 
defers rearranging the entire matrix until the final 
indices have been computed: 


V R+INDSORT MAT;COL;UI0O 
ETG ODI0+1 © COL+(pMAT)C2] O R+11tpMAT 
[2] A:>COL+B © R+RLAMATLR;COL]] 
3 COL+COL-1 © 7A 
[4] B:R+MATLR;] 
y 


R+DATASORT DATES 9 S+INDSORT DATES 
A/,R=S OR 
1 
1978 10 31 
1978 fi 30 
1978 T2 30 


1978 12 2 
1978 Te 5 
1978 12 15 
T9F9 t 5 
T979 1 16 


f N is the number of columns in the matrix, 
DATASORT subscripts NxN+1 columns of data, 
while TNDSORT subscripts only Nx3 columns of 
data. For a three-column matrix such as DATES, 
INDSORT subscripts 75% of the elements that 
DATA SORT does; for a ten-column matrix, only 27%. 
INDSORT can be further refined by computing the 
grade indices of the minor key directly, rather than 
using the index vector (11 tpMAT ) for the first iter- 
ation. The function NSORT in workspace 6 SORT 
uses this technique. 


Encoding Keys 


An alternative approach that is typically faster, but less 
general, is to arithmetically combine the keys into a 
single vector. Because of finite numeric precision in 
the computer, this approach is usable only if the 
resultant keys are all less than 2x56 (approximately 
7 .2E16). A typical formulation for positive integers 
is 
MATLA(0,.# O 1 +MAT)1QMAT; J 


This expression computes the minimum values that, 
when used as the left argument to base value ( 1 ), will 
uniquely represent rows of the matrix. 

Most of the time, we Know in advance the maximum 
values of each column, and we can embed constants 
directly into the code. With our DATES matrix, we 
might use either of the following as the argument to 
grade: 


1O0O0LQDATES 
19781031 19781202 19790105 19781130 
19790116 19781215 19781130 19781205 


0 12 31 LQDATES 
796157- 736190 730225 736187 736235- 736203 
736187. 736133 


We map the key values into distinct numbers such that 
the minor key is represented in the low-order digits, the 
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major key in the high-order digits, and the intermediate 
keys in between. 


Controlling the Order 


In all cases, the order of the sort can be controlled by 
using either A (for ascending order) or Y (for descend- 
ing order). Alternatively, because of the following 
identities, we can use only Å by multiplying the data by 
1 (for ascending order)orby 1 (for descending order). 


YA <+> Å-A <> AAx 1 


In fact, we can sort different days in different orders 
using these identities. For example, to sort DATES in 
descending order by year, but in ascending order by 
month and day, we can use any of 


R+|INDSORT DATESx(pDATES)p 1 1 1 
S+DATESCLA 0 12 31 1XDATES;] 
T<+DATES([L ADATES+.x (10000 100 1;] 
(A/,R=S)AA/,R=T OR 

+ 

1979 1 5 

1979 1 16 

1978 10 31 

1978 11 30 

1978 11 30 

1978 12 2 

1978 12 5 

1978 12 15 


An Unusual Technique 


Yet another technique for sorting an integer matrix M 
is the elegant algorithm shown below: 


R<M,0 © R4M[Ą+#R<.-XR;] 


Terse, and completely general, it is nonetheless quite 
slow in most cases. (An exception is “short, fat matri- 
ces” -those having very few rows and many columns.) 

The unusual inner product determines whether 
each row of M is smaller (1) or the same or larger 
(0) than every other row. It does this by performing a 
less-than reduction (< /) on the arithmetic differences 
between all pairs of rows. The </ ignores all leading 
zeros except for the last element, thus requiring the 
catenation of 0 (actually, any numeric scalar will work) 
to the matrix being analyzed. If the first nonzero 
element is negative, the </ returns 1; otherwise it 
returns 0. 


Another “All But” Application 


In the November/December issue of News, | pre- 
sented several techniques for performing “all but” 
calculations. Here is another that pertains to 
OFSTAC (setting file access), but may also be ap- 
plied to DIDLIST. 

The permission codes in column 2 0f LFSTAC 
are based on the sum of access codes, which are 
powers of 2. There is a fixed number (N<1i4) of 
distinct access codes at any one time, but / may 
increase in the future if new types of file access are 
defined. There are two approaches to setting file 
access: 


(1) Grant only the given access (additive): 
+/CODES 


A A A A A A LT ce; <ceE 


This technique is simple. Just add up the desired 
access codes to compute the permission code. For 
example, to give [FREAD (1), UFAPPEND (8), 
and FRESI ZE (1024) access, use the code 1033. 


(2) Grant all but the given access (subtractive): 
~1-+/CODES 


This approach requires that you either know N (the 
number of distinct access codes) and add up all codes 
other than those you wish to grant, or that you know 
that permission codes are actually interpreted as 
PC<+(2*N)|PC. 

To use the additive technique to give all but 
UF ERASE-.(+), GOFDROP (32), FF 16/17 
(256), and JIFSTAC (8192) access, you have to 
add up the codes for everything else! 


P71 2- 8 16 6# 128 Si2 7028 204s 4096 
7899 


The subtractive technique is more direct, and you don’t 
have to know Ñ: 


~1-+/4 32 256 8192 
~ 8485 

(2xN)| 8485 
7899 


We see that a permission code of 8485 is equiv- 
alent to 7899. The subtractive technique has the 
advantage of being independent of the number of 
types of access available. Moreover, it automatically 
grants any future access (as when WV increases to 
enable more types of access). | often give myself 5 
access (allbut UF ERASE )tomyownfiles to force me 


to perform an additional safety step before erasing a 


file (that of resetting the access matrix), and | com- 
monly use LJIJDLIST 9 to list all names except 
labels. 
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Other Sorts 


In the last issue | discussed sorting numeric matrices 
using the columns as keys. This month I'll discuss 
sorting character data and sorting rows independently. 
As usual, all examples are in origin 1 (LIZ O<1) unless 
otherwise noted. 


Sorting Character Vectors 


Grade up (A ) and down (Y ) operate only on numeric 
vectors. To sort character data, the data must first be 
transformed to numeric data. The normal technique is 
to use index-of (dyadic iota, 1 ) to map each character 
to its position within a defined collating sequence: 


LET<'FEEDBACK' 
LETULALET] 
DOMAIN ERROR 
LETLALET } 
A 
CS<«'ABCDEFGHIJ KLMNOPQRSTUVWXYZ' 
GEEET 
o e a 5-7 r S ad 
LETCÀCS:ı1LET] 
ABCDEEFK 


Sometimes the collating sequence is unusual: 
'PMCE' for continental U.S. time zones, 'CDEFGAB' 
for the musical scale, 'PBNRQK' for chess pieces, 
'TVXLCDM' for Roman numerals, and 'CDHS' and 
1934567897JQKA' for card suits and denominations. 
For example, let the following matrix represent a 
bridge hand: 


HAND 
K45QA3Q8J2TT7J 
HSCCHDSSSHCSC 


We can sort the hand by suit alone, 


HANDL;:¥'CDHS'1HAWNDL2;]] 
4Q8I7KA235QTI 
SSSSSHHHDCCCC 
by denomination alone, 


HANDLE: ¥'23456789TJQKA'1 HAND([1; ]] 
AKQQIJT875432 
HHCSSCCSSCSDH 


or by denomination within suit, 


D<¥'23456789TJQKA'1.HANDL1; J 
HANDL;:DLY'CDHS'1HANDL2;D))) 
QIBTUAK2Z3Q7T5 
SSSSSHHHDCCCC 


The last example illustrates the basic technique of 
sorting a character matrix by keys. The minor key is 
denomination (the first row), and the major key is suit 
(the second row). In this case, each key has its own 
distinct collating sequence. 


Sorting Character Matrices 


Most often we simply want to sort a matrix of words or 
names. The collating sequence is usually the alphabet, 
often prefixed with a space and punctuation marks and 
suffixed with the numeric digits: 


£8 
- / ABCDEFGHIJ KLMNOPQRSTUVWXYZ0123456789 


We can modify the JVDSORT function presented 
in the last issue to accommodate character data by 
simply translating the characters to numbers on line 
Bee 


V R<CS CINDSORT MAT;COL;QIO 
ESS OL0<1 © COL«(pMAT)L2] O ReiiltpMAT 
[2] A:*COL+YB © R+RLACSiMATLR;COL)) 
[3] COL+COL-1 © +A 
C4] B:R<MATLR; J 


WASHINGTON 
OREGON 
CALIFORNIA 
NEVADA 
CS CINDSORT WEST 
ALASKA 
CALIFORNIA 
HAWAII 
NEVADA 
OREGON 
WASHINGTON 


However, this approach tends to execute slowly be- 
cause it must iterate for every column of the array. 
Since we know in advance that the largest number we 
may encounter is 9 C'S (assuming the collating se- 
quence contains all possible characters), we can prof- 
itably encode several columns into one key using base 
value (1 ). By doing so we can reduce the number of 
iterations required. In fact, were it not for finite numeric 
precision within the computer, the following expres- 
sion would sort the rows of any character matrix. 


CHRMATLA(pCS)1CS1QCHRMAT; |} 


But the largest unique integer that can be distinctly 
represented on the APL*PLUS System is 2*56. If 
the result of base value exceeds that, precision is lost 
and words like “individualist” and “individualism” be- 
come indistinguishable. 

We must therefore calculate the maximum number 
of columns we can encode into a single integer. The 
derivation of MVOC, which must be integral, is as 
follows: 


((pCS)1MNOCp 1+pCS) S- 2*56 
(~1+(pCS)*MNOC) < 2*56 
((epCS)*MNOC) < 142456 
((pCS)*MNOC) < 2*56 
MNOC < (pCS)®2*56 
MNOC = L(p@S)®2*56 


eCS O O+MNOC+L(pCS )@2*56 
39 
10 


This formula assumes values from dyadic iota in the 
range 0 to 1+pCS. These can be achieved by ei- 
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CONTEST - An Unusual Technique 


The function CSORTR in workspace 6 SORT uses 
a completely different approach to sorting character 
vectors or rows of character arrays. The technique is 
distinguished by the fact that it uses very little inter- 
mediate storage, and is quite fast. Documentation can 
be obtained by executing 


UTILHOW 'CSORTR' 


in any Library 6 workspace. 
A dinner for two people awaits the person who 
writes a function named CSORTRX which 
1.matches the documented performance of 
CSORTR 
2. uses about the same static (USIZEF 'CSORTR') 
and dynamic (OWA required to execute) storage 
as CSORTR 
3.is faster (requires fewer CRUs) than any other 
entry. 


The benchmark will be 
)LOAD 6 SORT 

SAI o a 
y T+F00;C;D;CSC;CSD; CS 
[1] C+ 1000 8 pfIDLIST 4 O CSC+«(DAVeC)/DAV 
[2]  D«UTILHOW 'CSORTR' © CSD<«(OAVeD)/OAV 
ES] T<QAIC2 ] 
[4] CSSC O Cti CSORTRX C 
C5]  CS-CSD © D+1 CSORTRX D 
[6]  T+DAIL2]-T 

y 
Oh yes, one last minor condition: 

4. The M and J keys on your terminal are broken 
(both uppercase and lowercase), so you can’t use 
them in your solution. 

Entries must be postmarked no later than 31 May 
1979 to qualify. Address entries to Roy A. Sykes, Jr., at 
the Los Angeles office address given on the back of 
this newsletter. Or send entries to ROY via the STSC 
Mailbox. The winner will be announced in the 
July/August issue of News. Have fun! 


Royal Canadian Yacht Club and several other yacht 
clubs to score over 100 large regattas. 

The versionof SAILSCORE used in Newport scores 
“one-design” races, in which yachts of the same de- 
sign compete {as in the Olympics). Another version of 


SAILSCORE scores handicap races, in which boats. 


of different types and sizes compete. The latter version 


of SATLSCORE is particularly suited to scoring long- 


distance offshore races for yachts over 30 feet long. 

Jan emphasizes the interactive nature of 
SAILSCORE, which makes it extremely easy to use. 
He refined the program for this year’s Olympic Regatta, 
based on experience gained over the last several 
years. Because of this sophisticated software and the 
excellent response time on STSC’s APL*PLUS Sys- 
tem, contestants received their standings in a fraction 
of the time that it usually takes to post racing scores. 

SAILSCORE and STSC wiil also be at the 1980 
Olympic Trials. By donating software and computer 
time, Jan and STSC are making a big contribution to 
the U.S. Olympic effort. 


Man ulacturin G -° 


Manufacturing System Available 
To Run on Inhouse Computers 


STSC recently announced that its Comprehensive 
Manufacturing Control System (CMCS™) can now be 


purchased along with the APL*PLUS System software _ 
by companies that wish to run the application on their — 


own inhouse computers. The only hardware require- 
ment for the customer’s inhouse computer is that it be 
plug compatible with an IBM System/370 computer. 

Until now, the modular library of CMCS programs — 
covering most materials management operations in- 
cluding forecasting, inventory management, materials 
requirements planning (MRP), and shop floor control — 
was available only on STSC’s time sharing system. 

Because CMCS is also available in a time sharing 
mode, the responsible manager no longer has to feel 
irrevocably committed when deciding on a major soft- 
ware purchase; he can “rent” it first, so to speak, ona 
time sharing basis. In the time sharing mode, the user 
can test the system from top to bottom, trying every 
option and sampling each report. When he is satisfied 
that the system works for him in his operating environ- 
ment, he can make an informed purchasing decision 
with the element of risk virtually eliminated. 

For each purchaser, STSC wiil configure a system 
to include just those portions of CMCS that the cus- 
tomer has found useful and necessary by actual exper- 
imentation. Thus, each customer can buy only what he 


really needs to run his business, with none of the 
expensive extras that he might never use. It’s a rare 
product in this day and age that can be tested and 
customized before purchase. 

For more information on how CMCS can heip solve 


your company’s materiais management prcbliems, | 


contact our manufacturing services softice listed on the 
back of this newsletter. 


)ORIGIN 0 


This Whizbang is different pom its predecessors = its 

examples are written with DIO (the index origin) set to 

Os 
OT0<0 9 19 

0.4 2 B14. 5.61.8 


Along the way, we'll discover several practical ap- 


plications of this useful origin. First of all, oniy the 


following primitive functions are affected by the origin: 


ODR+A1B. 


ODR<+1iA 

ODR<?7A ODR<A?B 
ODR<+hA ODR<YA 
R+ALOSA] 
R<«OLOSAIA R<AOLOSAIB 
R<«+A/LOSAJ]B R<+A\LOSAJB 
R<+0/CLOSAJA RON OSA ge 
R<+A,LOSAJ]B R<OSA®A ` 


The name ODR signifies an TETI result; 
OSA signifies an origin-sensitive argument. ODR 


means that the result of the function i is affected by the 


setting of DIO; OSA means that the range of permis- 
sible arguments is affected by the setting of DIO. 
This distinction between dependent results and 
sensitive arguments is partially academic, however. 
Consider the following problem: Generate the value 


10 if DIO is1and 10 if DIO is 0. Beginners will - 


often struggle and come up with something like this: 
10xxDIO-0.5 a 11 CHARS, 3 FNS, DIO REF 


An experienced coder might produce: 


10x 1x~0DIO aA 10 CHARS, 3 FNS, UIO REF 


Some hotshots may even do some quick analysis and 
come up with a first-degree polynomial: 


O70120 19 a 10 CHARS, 1 FN, DIO REF 


But an aficionado, one who prizes APL for its elegance 


and terseness, yet recognizes and minimizes its cost, 


would code it as Oia 
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y r r y 
sE Fe ie § : 


10 10[1] a 9 CHARS, 1.78: 


To get back to my original point, although indexing 
(subscription) is considered to have an origin-sensitive 
argument but not an origin-dependent result, few 
would argue that the result of the above case is not 
origin dependent! 

What are some of the less obvious ramifications of 
origin 0? For one, a trusty standby for branching 
doesn’t work: 


+LABELx1CONDITION oo (E.G., *Lx1M2C+C+1) 


In fact, we discover it either branches to 0 or falls 
through, and the multiplication (and hence the value of 
LABEL) is irrelevant. In origin 1,11 produces 1 (as a 
one-element vector) and1 0 produces an empty vector. 
But in origin 0,1 1 produces 0 (again as a vector), while 
10 still produces an empty vector. When users catch 
‘on to this unpleasantness, they quickly change to 


+LABEL[ 1CONDITION 


which works in either origin. (*CONDITIONpLABEL) 
and +CONDITION/LABEL also workin either origin, 
and are usually preferable.) 

Another strange effect of origin 0 is the behavior of 
dyadic iota (1), particularly with elements not found. 


pA+ 649-129 6. 9-109 
g : 

r E Ter s E 3435 A A, 
y e os E a 


For most of us, the eighth position is number 8 (that is, 
we commonly think in origin 1); but in origin Othe 
eighth position is number 7. Do youremember the rule: 
“If an element of the right argument to dyadic iota is not 
found in the ieft argument (A), its corresponding 
position in the resultis setas 1+pA.”? Well, the correct 
formula is actually 070+ pA. That is, the indices in A 
are 104, and the “not found” flag is actually 1 plus 
the last index: 1+. 1+10A (or the origin itself, if Ais 
empty). | 

There are other interesting effects of origin 0. For 
instance, the results of +/1N and 2!W are identical. 
To laminate two vectors to form a two-row matrix, we 
must specify a negative axis! 


A,[L 0.5J0A 
6 9-6 9 10-30 
a 16249 6.98 1S OE 


Particularly useful is the fact that we can use Boolean 
results (as from relational functions) to subscript di- 
rectly, rather than adding 1: 


+(LZ1,L2)(€D>pA] a BRANCH TO EITHER 
OF TWO LINES 


' O'[Aco.>if/A] a A HISTOGRAM 


OO0000 
0 


0 
Oo0000000 
000000 
OO0000000 
OO00000000 F 
Behavior with respect to take and drop (+ and +) is 
modified when their left argument is the result of 
origin-dependent functions (iota in this case): 
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INP<+'PRINT/34'! 
Of0<1 © A+( 1+I«INPi't/')t+INP © B+I+INP 
DIO+0 O A+(I+INP1'/')+ INP Ò Be(I+1)+INP 
INPA.=A,'/',B 


Sometimes it is necessary to write a function that 
works in either origin, but in which you can’t set or 
localize O20. For exampie, a function that returns 
indices should return them in the origin of the calling 
environment. Such functions are called origin sensi- 
tive and are usually annoying to write. Some hints 
follow: : | 
è If you subscript a vector often, rather than modifying 

the index at each operation (for instance, using 

VEC[OI0+31] to select the fourth element), con- 

sider modifying the vector initially and using fixed 

origin-1 subscripts: 


VEC+ 14 13.12 2 8 21 6.10 10 25 ig 

VEC+DIO+0,VEC a OR: VEC+(1~DI0),VEC 

VEC[4 6 3] bd 
2.52142 


è If you refer to counters (C) or limits (Z) related to 
subscripted arrays, remember to adjust the count- 
ers or limits for the origin. Origin 1: 


DIO+1 © R+(L+pVEC)p 1 © +AxL2C+DI0 
A:RC[C]+ppDFREAD TN,VECLC] O +AxL2C+Ct1 


Origin 0: 


Or0+0 © R+(L+pVEC)p 1 © +AxL>C+OI0 
A:RLC]+ppDFREAD TN,VECLC] © >AxL>C+C+1 


Any origin: 


R<«(L+pVEC)p 1 © 7Ax(L+L+0I0)>C+UI0 
A:RLC]<ppOFREAD TN,VECLC] O +AxL>C+C+1 


e If youuse the +\BITVEC technique for generating 
` indices, prefixitby BITVECCOIOJ<OUIO toworkin 
either origin. 
As a rule, all functions should have one of the 
following characteristics: i 
e origin independent; thatis, DIO is localized and set 
on line [1] to a fixed value | 7 

@ origin free; that is, the function does not refer to 
OIO explicitly or implicitly (uses no origin-sensitive 
primitives) 

ə origin sensitive, as described earlier. 

If you must write origin-dependent functions (those 
that work properly in only one global origin), you should 
state this limitation prominently in the documentation 
for the function. 

Since the capability to localize the system variable 
OZO became available, | have observed that most 
functions | write have DIO localized and set to 0. 
Origin 0 usually seems more Convenient. 

For example, whenever you are dealing with num- 
ber systems, using primitives like modulus (or residue, 
| ), encode (or representation, T), and decode (or base 
value, 1), originO is the natural base. lf a number 
represents several quantities packed together, its 
origin 0 representation is consistent with the packed 
form. In the following example, we want to select 
several scattered points from a matrix M. _ 


~~ ee 


oe ee 


— peer meetin eae “Siti: elem ie iene a ate AEN STE ea I RT AE RE PI A 


Comments: 


O+M+ 4 7 910x128 
D 40 20 .<30-: 40 =:.50'°:60 
70 80 90 100 110 120 130 
140 159 160 170 180 190 200 
210 220 230 240 250 260 270 
Deye? 130 ,f°0.5) 13 4 6 a ROW,L 0.5)COL 


2 423330 
i 3 4"s 3 
(,M)CO«( pM) iN) 
15 10 25 6 


150 100 250 60 


Since the indices in Vare already in origin 0, their 
decoded form is suitable for origin-0 indexing. 
Here are some othercalculations that are simplified 
in origin 0: 
@ Compute the number of days in February (years 
1901-2099): 


29 28 28 28[4| YEAR] a OR 29-x4|YEAR 


@ Branch to different lines based on the rank of an 
array or the result of ONC: 


[3LppARRAY] 
X)CONC 'NAME' J 


- al 


è Signal only the first line of the diagnostic message if 
an error occurs: 


DFLX+'OERROR(ODMi0TCNL) pUDM'! 


è Given a matrix M with distinct elements, determine, 
as a two-row matrix, in which row and column certain 
numbers W appear (this is the inverse of the scat- 
tered-point example given above): 


(pM)T(,M)iN 


In origin 1 it would be 
1+(pM)T((.M)1N)-1 


e Branchto 0 if 'END ' isin V; otherwise, fall through: 
+iv/V OSS ‘END! 
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by Roy A. Sykes, Jr. 


A Clever Sort 


In the March/April 1977 issue of this newsletter, | 
presented a series of string-searching algorithms cul- 
minating in a function (SS6 ), which continually nar- 
rowed the search until completion. SS6 was typically 
fastest because it searched the fewest possible num- 
ber of elements at each iteration. Although SS6 was 
the most complicated function presented, its algo- 
rithmic sophistication outweighed the code complexity. 

In this issue, | present another such algorithm, this 
one to grade numeric matrices. The function pre- 
sented. ere is derived from a function to grade charac- 
ter matrices written by Dr. Ross Hamsher, a systems 
analyst at Thermo Electron Corporation. The funda- 
mental algorithm has been known for many years; Ron 
Murray wrote a version in APL at STSC seven years 
ago. But Dr. Hamsher’s function represents a major 
and elegant simplification of the algorithm, with noloss 
of generality. 

The naive approach to grading a numeric matrix is 
to repetitively rearranage a permutation vector, start- 
ing from the last column (the minor key) and ending 
with the first column (the major key). For simplicity, we 
will use origin 1 in NMGU (numeric matrix grade up). 


V R«NMGU MAT;CI;0I0 
[1]  DZIO+1 © ReriitpMAT © CI+(pMAT)[2] 
2J = ACIO 
[3]  ReRUAMATUR;CI]] © CI<CI-1 © >A 

v 


Line [1 ] computes the initial permutation vector ( R ) 
and the index of the first column to grade ( CI ). Line 
[2] escapes if CI is 0. Line [3] rearranges the 
permutation vector according to the next key (column), 
decrements the column index, and branches back to 
the test on line L2]. 

NMGU has the virtue of simplicity, but it is totally 
insensitive to the characteristics of its argument. For 
example, consider the following matrix: 


MAT 
617 267 6864 462 305 
301 pee meee SF eos © Sear! oe OAS Seca Se SS 
213 SHOE S61 1-22 243 240 
713 o9i=-- 6319 > 7000 700 


Z2 tals, Saws 747 O 

312 moO. 7/600 15580 0 

301 Ee Sere - as | See ET 401 
NMGU MAT 


le: SS care O Saar 


MATL NMGU MAT; } 
dae Er 731-3905 747 0 
213 340 4611 21243 240 
301 Bae Sage po" teers £: BES 401 
301 You + E 3 - 7707 F414 
312 v0 7600 1550 0 
617 267 6864 462 305 
713 CIE - 6373. Fu 700 
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NMGU dutifully processes all elements of every col- 
umn. However, without much inspection it’s easy to 
see that we really need only examine the first two 
columns of MAT todetermine the grade vector. In fact, 
the only reason we need to look at the second column 
is to resolve the duplicate 301 values in the first 
column. | 

This technique - sorting each column only to the 
extent necessary to resolve duplicates in previous 
columns - is intuitive to most people. It is somewhat 
more complex to describe. 

Inthe function VMGRUP presented below, a Boolean 
vector ( U ) flags row indices ( R) of MAT whose rela- 
tive ordering is so far unresolved. A partition vector 
(PV) is employed to segment the flagged rows 
(MATLU/R; 1) into groups of equal value. 

At each iteration the value of another column is 
used to permute unresolved elements of the result 
within their partition. Typically, this process complete- 
ly decides some ties and further segregates others. 
Hence some partitions disappear (become singular) 
while others break into several smaller partitions. The 
process continues until either all columns are exam- 
ined, or all partitions disappear. 

The function and a detailed description follow. 

V R«NMGRUP MAT:;D;G;N;U;CI;NC;PV;UR;DIO 
ee DIO+1 © Rei N*1tpMAT © +(NC+(pMAT)[21)¥4¥0 
[2] PV+Nt1 © U+Np1 © CI+1 
[3] A:G*hD<«MATLUR*U/R3CI) © G<GLA(+\PV)[G]] 
[4] R(U/iNI]<UR[G] © >(NC<CI+CI+1)p0 


[5] DeD[G] O PV+PVVD4"16D © U+U\UR+PV%10PV 
[6] >AxxpPV+UR/PV 


cag Set the origin to 1. 
© Compute the number of rows (1) and the 
tentative result ( R). 
© Compute the number of columns (NC ); quit if 
there are no columns. 

p24 Initialize the partition vector (PV) toa single 

encompassing partition. 

Initialize the unresolved element flags ( U ) to 

all 1. 

Initialize the column index (CI) to 1. 

Select unresolved row indices ( UR )from the 

result; select the corresponding data ele- 

ments from the CIth column ( D); compute 

the ascending grade vector (G). 

© Rearrange the grade vector ( G ) according to 
the partition of each element. G nowreflects 
ascending order within each partition. 

C4] Permute the unresolved elements (U/W) 
of R (UR) to reflect the ordering of the cur- 
rent column. 

© Increment the column index; quit if all col- 
umns have been processed. 

ee Rearrange the data elements ( D ) in ascend- 
ing order within each partition. 

© Determine changes in the ordered vector 
and update the partition vector (PV ). Each 
partition will be split into one or more smaller 
partitions. 

© Flag unresolved partitions; that is, those of 
length two or more (UF). Update the unre- 
solved element flags ( U ). 


I> oD > 


[6] Select only unresolved partitions; continue 
processing only if at least one partition re- 
mains. 

NMGRUP can be further refined by diagnosing 
some special cases (matrices with less than two rows 
or columns), and by preprocessing the first column to 
avoid the generalized partition-processing code in the 
loop. 


The refinements affect only the setup (lines [1] 
through [4] of NMGRADEUP), the loop (lines [5] 
through [8]) is identical to that of VMGRUP (lines 
[3] through[6 ]). 

We have added two functions to workspace 
6 SORT on the APL*PLUS System that incorporate 
Dr. Hamsher’s algorithm: NGRADEUP and CGRADEUP 
(numeric and character). If you experiment with de- 


generate arguments such as 5 1001100, you will 
find that both perform considerably slower than naive 
algorithms because they have no opportunity to nar- 
row the sort. But with typical corporate data (such as 
names and addresses) having relatively few duplicated 
rows and early resolution of partial duplicates, the 
decrease in CRU consumption can be dramatic. Thank 
you for a clever and inexpensive algorithm, Dr. 


V R+NMGRADEUP MAT;D;G;N;U;CI;NC;PV;UR;DIO 
A DI0O+1 © N+pR+ÅD+(MAT,0)[;1] 
2] NC+(pMAT)[2] © >(2>NLNC)pO 
31 D+D[R] © PV+D# 16D © U+PVx1PV 
4] +(pPV<U/PV)+0 O CI<«2 
5] 
6] 


os 


A:G+hD«+MATL[UR<U/R3CI] © G+GLA(+\PV)(G]J] 
~REU/iNIJ<URLIG] © +(NC<CI<CI+1)p0 

J D+D[G] © PV+PVVD4 10D © U*«U\UR*+PVA1IOPV 

8 ] +AxxpPV*«UR/PV 
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Mathematical programming is used to analyze a 
wide variety of problems in industry, government, and 
the military. MPSX/370, internationally recognized as 
the state-of-the-art mathematical programming sys- 
tem, provides the following features: 

e excellent speed of optimization and reliability of 

algorithmic calculations 

@ ability to interface with databases, programming 

languages, and VM’s Conversational Monitoring 
System (CMS) 

@ ability to solve linear, integer, mixed-integer, and 

separable problems efficiently 

è continuing IBM support and maintenance. 

The added support of STSC’s Management Tech- 
nology Division (MTD) makes using MPSX/370 espe- 
cially attractive. MTD experts are thoroughly familiar 
with the development and use of sophisticated modeling 
systems, and can answer your questions whenever the 
need arises. 

The modeling systems developed by MTD are de- 
signed for the business analyst/manager as well as the 
computer programmer/mathematician. MTD has 
helped STSC customers develop systems that are 
flexible and user-oriented. These systems are data- 
driven and can handle a wide range of problems 


without any need for reprogramming. You can com- 
municate with these systems using terminology you 
are familiar with, and you can produce reports that 
summarize the results in the format you have specified. 

MPSX/370 has powerful sensitivity analysis capa- 
bilities, allowing you to determine what happens to 
the optimal solution when specific assumptions or 
conditions do not hold true, or when they are changed. 
This analysis is often the most important and expen- 
sive step in using mathematical programming tech- 
niques. With systems designed by MTD, however, you 
can often perform sensitivity calculations for less than 
10 percent of the cost of obtaining the initial optimal 
solution. This efficiency allows you to test your results 
rigorously before implementing them. 

You can run MPSX/370 modeling systems inter- 
actively at your terminal to obtain immediate results. 
The potential savings to your company of finding the 
best solutions to basic business problems make 
MPSX/370 an excellent investment. 

For more information about MPSX/370, contact 
Chris Stathes, Manager of Optimization Systems, at 
the Management Technology office listed on the back 
of this newsletter. 


by Roy A. Sykes, Jr. 


Arrays of Arrays 


One of the strengths of APL is its native ability to 
manipulate multidimensional arrays. Whereas most 
other programming languages are basically oriented 
toward scalars, APL is oriented toward rectangular 
arrays. 

Since the early 1970s, language designers (among 
them Jim Brown, Ziad Ghandour and Jorge Mezei, and 
Trenchard More) have been investigating a powerful 
extension to APL, variously called arrays of arrays, 
general arrays, or nested arrays. Under these pro- 
posals, each element of an APL array could itself be an 
array. Thus, a vector would still be a linear collection of 
elements, but each element could be a vector, a matrix, 
or whatever. And each element of those subarrays 
could, in turn, be an array. Boggling! 

Actually, these concepts are already familiar to APL 
users. For example, a file may be thought of as a 
generalized vector wherein each element (component) 
may be an array of any type, rank, or shape. A matrix of 
names is really a vector of variable-length vectors, 
which, to fulfill today’s requirement for rectangularity, 
we pad with trailing blanks. 

Using the facilities currently available, let’s investi- 
gate acommon problem in APL, that of displaying a list 
(vector) of character objects (scalars, vectors, or ma- 


trices) on a page or screen (a two-dimensional surface, 
or matrix). Let’s assume our page is 24 lines long and 
80 characters wide, that we always want to display an 
entire 24 by 80 page, and that we’re working in 
origin 1 (O70<1). 

The easiest case is displaying a vector; APL 
handles this problem directly. We have a vector of 
(nominally 1920 +> 24x80) scalars - we want a 24 
by 80 matrix of scalars. First we must assure enough 
elements to fill a page: (x/ 24 80)+VEC.Then we 
simply reshape: 


24 80 p(x/ 24 80)4VEC 


This prints the elements in row-major order; that is, 
reading from left to right, then down the page. If we 
wish to display in columns (down the page, then left to 
right), we would use transpose: 


® 80 24 p(x/ 24 B80)tVEC 


Since we'll be dealing with more complex problems 
later, let’s generalize the above two expressions (the 
underlined portions may be elided): 


SCALARS, ACROSS: 


The next problem is stated thus: Given a vector of 
16-element vectors (for example, names), nominally 
arranged as a 120 (120 +> 1920+16) by 16 matrix, 
make a 24 by 5 matrix of names (each a 16-element 
vector). To display across the page, the solution is easy 
(see Figure 1 for the result): 
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oMAT 
416-26 
24 80 p((x/ 24 5),16)4MAT 


We first assure enough “elements” (each a 16- 
element vector) to fill the page; then we simply 
reshape to the desired page size. The row-major 
ordering implicit in reshape does all we need. 

However, to display the vectors down the 
page, we conceptually need to create a 5 by 24 
matrix of 16-element vectors, transpose it, and 
reshape it to our page size (see Figure 2 for the 
result): 


24 80 p 213 8 5 264 16 p((x/ 24 5),16)4MAT 


Observe that we started with a matrix, increased 
its rank by one to express the logical structure (a 
5 by 24 matrix of 16-element vectors), inter- 
changed the coordinates representing the 
“outer” structure (that of a 5 by 24 matrix to that of 
a 24 by 5 matrix), and finally reduced the rank to 
that of our medium (a page or screen). 

Again, let’s generalize the two expressions to 
see if we can detect any parallels: 


VECTORS, ACROSS: 

24 80 p_1_2_.3_8 24 5 16 _9((x/ 24 5),16)+MAT 
VECTORS, DOWN: 

24 80 p 21 3&5 24 16 p((x/ 24 5),16)+MAT 


The final problem is a natural extension of the 
first two: Given a 20-element vector of 6 by 16 
matrices (for example, addresses), nominally 
arranged as a 20 by 6 by 16 array (20 <> 
(x/24 80)+x/6 16 ++ 1920+96) produce 
a 4 by 5 matrix of addresses (each a 6 by 16 ma- 
trix). 

This time we are starting with a three-dimen- 
sional array whose first coordinate represents 
the “outer” structure (a 20-element vector) and 
whose trailing coordinates represent the “inner”, 
or subarray, structure (6 by 16 matrices). Again, 
to display across the page, we assure enough 
“elements” to fill the page; then we increase the 
rank by one to express a new logical structure (a 
matrix of matrices), interchange the coordinates, 
and reshape to the page size (see Figure 3 for the 
result): 


pTEN 
17 6 24 
24° 80-p 1°3.2-46 8 & 5 6 16 pC(x/ ¥ 5), 6°46)47ER 


Notice that here the transpose interchanges 
the coordinates of the layers and rows because 
we want layers adjacent rather than rows; other- 
wise, the rows of each address would be spread 
across the page. 

To display the matrices down the page, we 
perform the same operations, again interchang- 
ing the leading two coordinates of the reshape 
and the transpose (see Figure 4 for the result): 

24. 80: f 3 2-34-85 A466 ot fey 83 8 AB 


We can summarize the expressions to display 
matrices: 
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Figure 1 - Vectors Across a Page 
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Figure 2 - Vectors Down a Page 
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Figure 3 - Matrices Across a Page 


MATRICES, ACROSS: 
2430p 1324 84 5 6 16 p((x/ & 5), 6 16)47EN 


fe MATRICES, DOWN: 
ENKAY $00 EASTOWNE DR CECILIENALLEE oe 24 80 p 312485 4 6 16 p((x/ 4 5), 6 16)tTEN 
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which is described in Supplement No. 2 to the 
APL+PLUS Address Management Package 
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REPLY FORM 


O I would like more information about APL*PLUS Service for the application 


O I would like to order an STSC publication. Please send me a Publication Order Form. 

O Please register me for the next three-day Introductory APL class in city 

O Please add the individuals listed below to your mailing list and send them a copy of this newsletter. 
O Please correct my address and/or telephone number as shown below. 

O Please remove my name from your mailing list. 

Comments: 
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STSC’s Network Design Systems 
Highlight Communications Conference 


STSC’s communications network specialists at- 
tended the “Communications Network 80 Confer- 
ence’, held at the Washington, D.C., Sheraton 
Park Hotel on 28-31 January. The conference fea- 
tured leading vendors of communications hard- 
ware and software services and over 70 sessions 
emphasizing new communications technology and 
policy. 

STSC's exhibit featured the ARIES™ Network 
Design and Analysis System. ARIES combines the 
latest techniques for network design and the APL 
programming language to deal with large data ar- 
rays. It has been used for over eight years by 
communications professionals in the areas of cir- 
cuit design, response time analysis, data network 
design, network management, and voice network 
design. 

Representing STSC at the conference and ex- 


hibit were Robert L. Ellis, Manager of Communica- - 


tions Network Planning, David H. Durgee, Com- 
munications Support Programmer, and Phyllis A. 
Prats, ARIES Steward. John W. Myrna, STSC’s 
Vice President — Development, also attended the 


Expansive Thinking 


When IBM released the first implementation of 
APL in 1968, they gave it the name APL\360. 
Computer people understood that ‘360’ referred to 
the then-current IBM 360 series of computers, and 
learned that APL stood for A Programming Lan- 
guage. But that funny slash ( ) — now wasn’t that 
backwards? 

Of course, APL devotees chuckled and under- 
stood — it meant APL expanded the power of an 
IBM 360. (I suppose STSC chose its trademark, 
APL*PLUS, for similar reasons ... STSC’s en- 
hancements raised the power of APL.) Thus ex- 
pansion was the first function particular to APL to 
which many people were exposed. Unfortunately, 
newcomers to APL don't have the benefit of un- 
derstanding such symbolism, and it is in their 
interests that | present this article on expansion. 


: Network 


conference as a featured speaker. His presenta- 
tion, entitled “Electronic Mail — Living in the Land 
of BOX’’, discussed the implications of electronic 
mail systems. He focused on STSC’s nine years of 
experience using the APL*®PLUS Message Pro- 
cessing System (MAILBOX) and discussed the im- 
pact that such systems will have in the future. 

A highlight of the conference was the introduc- 
tion of the OPTYMUS™ Voice Network Design 
System, a product jointly developed by STSC and 
Telco Research Corporation of Nashville, Tennes- 
see (see ““OPTYMUS Lowers Costs for Voice 
Communications Networks” in the September/ 
October 1979 issue of this newsletter). OPTYMUS, 
currently offered on STSC’s APL*®PLUS Time 
Sharing Service, allows telecommunications 
managers to design their voice networks to take 
advantage of the latest technological advances in 
voice communications, such as call queuing and 
automatic route selection. 

For those of you who missed “Communications 
Network 80”, STSC’s network design specialists 
will have an exhibit at “Interface 80’, to be held 
17-20 March in Miami, Florida. Over 250 exhibitors 
will display the latest communications hardware, 
software systems, and services, and STSC’s com- 
munications specialists will be there to demon- 
strate the many aspects of telecommunications 
design. In the meantime, a new, 44-page brochure 
describing the entire ARIES product line is avail- 
able from Bob Ellis at the Communications Plan- 
ning office listed on the back of this newsletter. 


Expansion, like its companion function com- 
pression, accepts any array as its right argument, 
a Boolean vector as its left argument, and optional 
use of the axis operator ({K]), for which we'll as- 
sume origin 1(U)Z0<1): 


R«BV\ARR (last coordinate, same as 
BV\LoppARRIARR) 

R<«BV\ARR (first coordinate, same as 
BV\L1JARR) 

R«+BV\LKIJARR (Kth coordinate, general form) 


The result is the same as ARR except that the 
Kth (or last or first) coordinate is expanded to 
length (o BV). Positions corresponding to 1’s in BV 
are filled with data from ARR; positions corre- 
sponding to 0O’s in BV are filled with 0o’s (for 
numeric data) or blanks (for character data): 


WO eS Oo 2 oes. oY 2h ae. oe RS 
it: 2 0 2 22.0 0 33.90. 68 

W<'ANTEPENULTIMATE' 

A Se Sy SE ER oS 0-8 2-8 SSE OE 8 oe 
AN TE PEN UL TI MATE 

r MN ie Me "ee ee 
LENGTH ERROR 

ce eae Rice: ea ee T 

A 


Since the 1’s dictate where the data in the right 
argument is positioned, there must be exactly as 
many i's in the left argument as there are ele- 
ments in the right argument. More formally: 
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for R+-BV\[K]ARR 
(+/BV)=(pARR)C(K] 
(oR)A.=((K-1)+pARR),(pBV),K+pARR 


If the right argument is a scalar, it is extended to a 
vector of the proper length (+/BY): 


Brie IO 0 TSP 
APL\360 
0 0 360 .360 0 


Expansion is a natural partner to compression. 
Compression “undoes” what expansion does, but 
not vice versa. That is, (except for scalars) A is 
identical to B/B\A. But the expression B\B/A 
fills the result with O’s (or blanks) in positions cor- 
responding to 0’s in B, thus making B\B/A iden- 
tical to BxA for numeric vectors. For matrices we 
can set rows (or columns) to 0 based on a condi- 
tion. 


MAT 
7-2°2 2 2 2-2. 2-3 3 
Si 2-5 21a Ss 8 
Y <2. 4 St 2-8 T. 
: ae Sak Bee Sey ee ee ae E 
Spe Fee 
S93) <4 Sy Ge 1 73 aes 

O+COND<20<+/MAT 

E a + doa? ie | 

COND\COND#MAT 
O. 2 oS. 0) 0.0, E @ 
32 145 oe so eS 
25: She 2 Te TR r 
a DO 50 O40 B29 9 
UOO. 6.0 oo. 6-8 
ea os? oe D'S rs 


The above expression is clearer than the corre- 
sponding multiplication (Shown below) 


MATx&(dpMAT ) pCOND 


and has the advantage of also working with 
character arrays (blanking positions correspond- 
ing to O’sin COND). 

More commonly we wish to perform an opera- 
tion only on selected elements of an array, and fill 
the other elements with a ‘‘fill’” value. For example, 
suppose we are given several employee numbers 
as EN and want to print their names and social 
security numbers. We have available a master vec- 
tor of all employee numbers ENOS, a correspond- 
ing matrix of employee names ENAS , and a vector 
of social security numbers SSEC. 


"TH ,X2,6A41,X2,I9' OFMT (ENOS;ENAS;SSEC) 
1158 PERRY 980304871 
1230 BRENT 876235613 
1272 HOLTON 231879651 
1363 WILKES (128623497 
1514 GOLDEN 123123123 
1778 REGIS 878787878 
1863 PIERCE 222334444 
1881 JOHNS Le2so7 290 


1955 Wiad 547940666 
1978 SMITH 123456789 
EN 


TiS 2270 2272-1555 2660 1/78. ae 


The fundamental algorithm 


I«ENOSiEN 
O<+RPT<EWASLI3;],'G< 999-99-9999>' QFMT SSECLI] 
INDEX ERROR 
O+RPT+EWAS(I31,'G< 999-99-9999>' FMT SSEC([TI]) 
A 


fails if any of the employee numbers in EN is not 


found in ENOS (1270,1555, and 1660 in this 
case). 
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Since we really wish to select and format only 
valid employee numbers, we adjust the algorithm 
to use only those indices that exist, leaving a blank 
line for invalid employee numbers. 


I+ENOS.EN 

E<I<pENOS w I<E/I 

O<RPT<EXENAS[I3;],'G< 999-99-9999>' OFMT SSEC[TI] 
PERRY 980-30-4871 


HOLTON 231-887-9551 


AEGIS ~878-76-7878 
MIX 547-94-0666 


lf we wish, we can further refine the report to in- 
clude a diagnostic message for invalid employee 
numbers by using the negation of £ to identify the 
rows of RPT that were left blank by the expansion. 


R+oNF+(~E)/102F 
ERR+«((R,10)o'NOT FOUND:'),¥(R,1)pENLNF] 
RPTUNF:11+pERRIJ+ERR 
RPT 

PERRY 980-30-4871 

NOT FOUND: 1270 

HOLTON 231-87-9651 

NOT FOUND: 1555 

NOT FOUND: 1660 

REGIS 878-78-7878 

MIX  547-94-0666 


The general form of such exception processing 
for vectors is as follows (assume PROCESS and 
EXCEPT process data to produce the result): 


RESULT*E\PROCESS E/DATA 


and optionally 


N+(~E)/1pE 
RESULT(NJ+EXCEPT DATALN]) 


Expansion is normally viewed as using the left 
argument to modify the right argument. Some- 
times the reverse is true, with the right argument 
being used to replace 1’s in the left argument. 
Consider the following problem: Given the QVR 
form of a function VR, you wish to select certain 
lines LN. 


VY RLN VRSELECT VR;PV;0I0 
fad OLO<«1 x +(pR+ 64VR)+0 
E27 PVs 108 GSS OFCHL,'L*. KRECI jal 
[3] LN+(1+/PV)eLN+1 
C4] PVL[PV/1pPVJ]<LN# 140,LN 
[5] R+(#\PV)/R 


0 3 & VRSELECT DVR '‘VRSELECT* 
V R<+LN VRSELECT VR;PV;0I0 
[3] LN<(1+/PV)eLN+1 
FEJ PV[PV/19PVJ<LNz 140,LN 


A partition vector PV identifies the beginning of 
each line (a bracket preceded by a new-line 
character). The line numbers are transformed into 
a Boolean vector indicating, for each line of the 
function, whether it is to be selected (1) or not (0). 
The tricky code is on line [4]; we modify PY so 
that the first and last+1 element of each contigu- 
ous set of selected lines remains marked with a 1, 
but all other lines become unmarked. Then we use 
not-equal scan (#\) to mark odd parity, which 
selects the proper lines. Using expansion, lineL4 ] 
becomes simply 


C4] PV+PV\LN2 1+0,2N 


thus using its right argument to modify the left! 
The last technique we will discuss is a tradi- 
tional use of expansion to form a rectangular APL 
array from a vector of delimited substrings. Vari- 
ously known by names such as VTOM, SSMAT, 
MIM, ROWNAMES, and BOX, this algorithm uses a 
raveled Boolean histogram to supply the proper 
number of fill elements to expand the vector so 


Line 2 ] determines the indices of the delimiters. 
Line [3 ]computes the lengths of the substrings. 
Line [4] generates a minimal-width histogram, 
using 1's to mean “select another element from 
V”, and 0’s to mean ‘‘fill with blanks (or 0's)”. The 
expansion and reshape (p) on line [5] use that 
histogram to expand the non-delimiting elements 
of V into rows of the resulting matrix. 


w- 


M<VTOM | /BRENT/GOLDEN/HOLTON/JOHNS/MIX' 


that each substring is extended to the same oM 
length. By convention the delimiter is the first 5 6 
M 

element of the vector. me 

VY M<«VTOM V;DEL;LEN;HIST;INDS;0I0 GOLDEN 
ee OLT0<1 « Me 0 0 pV w +(LEN<p,V)+0 HOLTON 
re INDS+( DEL<V=1t+V)/1LEN JOHNS 
[3] LEN<(1+IWNDS ,LEN+1)-INDS+1 MIX 


C4 J HIST+LENo.21[ /LEN 
CS M+(pHIST)p(,HIST)\(~DEL)/V 
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several management positions with Automatic Data 
Processing, Inc. (ADP). At ADP, he was the original 
developer of ADP’s general ledger and financial report- 
ing products. Ira is a native of Baltimore, Maryland; he 
earned a B.A. in accounting from the University of Balti- 
more, where he also attended law school. 

Working for Ira in the development of STSC’s finan- 
cial and accounting applications are the following prod- 
uct managers: Robin D. Mattern, Budgeting and Report- 
ing Products; Sheldon P. Needle, Accounting Products; 
and Brian G. Smolens, Financial Planning Products. In 
the coming months, STSC plans to offer a series of 
financial and accounting applications that are fully inte- 
grated with its existing Financial Planning System (FPS). 

For more information on STSC’s financial products, 
contact Ira Apatoff at the corporate headquarters listed 
on the back of this newsletter. 


STSC Opens Marketing Office 
In Rochester, New York 


On 1 June 1980 a new STSC marketing office was 
opened in Rochester, New York. The new office will 
serve customers in the Rochester and upper New York 
State area. The address and telephone number of the 
new office are 


STSC, Inc. 

3000 Winton Road South 
Townline Park, Building E 
Rochester, New York 14623 
716-442-5281 


William H. Bickford is the manager of the new branch 
office. Bill was formerly manager of the White Plains 
STSC branch office located in Elmsford, New York. 

If you or your company are located in the upper New 
York State area, contact Bill at the above address for 
information on STSC’s APL* PLUS Service. 


New Workspaces Provide 
Foreign-Language Input Functions; 
Plotting on AJ 832 Terminals 


In recent months, several new workspaces have been 
added to STSC’s APL*PLUS Service. Workspaces 
6 SAISIE, 6 EINGABE,and6 ENTRADA provide 
input utility functions in French, German, and Spanish, 
respectively. These functions, which issue error 
messages in the appropriate language, were derived 
from the English-language functions in workspace 


6 INPUT. (See “Workspace 6 INPUT Provides 
Input Utility Functions” in the November/December 
1979 issue of this newsletter.) Each input utility work- 
space contains functions that prompt for and accept in- 
put from the user in a uniform manner. Applications that 
take advantage of these standard input functions are 
not only easier to write and maintain, but also easier to 
use. | 

Another new workspace, 10 AJPLOT , contains 
the function AJPLOT , which produces graphs on an 
Anderson Jacobson 832 terminal equipped with the 
ultraplot feature. 

For more information on these new workspaces, 
contact your APL*PLUS Service representative. 


Rotation 


The symbol for rotate in APL, $, evokes the image 
of a child's toy top, or of a globe with the axis of rotation 
drawn in. It is formed by overstriking the circular o and 
modulus | symbols. Rotation treats a vector as a con- 
tinuous chain of values, which it shifts (cycles) accord- 
ing to some modulus of the left argument. 

Essentially, rotate shifts a vector of values (speci- 
fied in the right argument) a number of places (specified 
in the left argument): 

0<V<'RESPECT' x 20V ® ~16V 
RESPECT 


SPECTRE 
TRESPEC 


As you can see, if the left argument is positive, the 
elements are shifted to the left (beginning to end) and if 
the left argument is negative, the elements are shifted 
to the right (end to beginning). Of course, if the left 
argument is 0, no shifting is performed at all. But what 
of large left arguments? For example, 
125o'EIGHATH' 
HEIGHT 


Does APL actually tediously shift the vector 125 posi- 
tions? 

Consider the question: “What is the position of the 
connecting link in a bicycle chain having 60 links after 
rotating the chain 1250 links?” | hope, rather than rotat- 
ing the chain, counting off 1250 links one by one, you 
would realize that the first 1200 link movements 
are superfluous, and that all you need do is rotate the 
chain 50 links and note the final position of the con- 
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necting link. In APL nomenclature, you would rotate 
60|1250 links. 

Rotate ( VV ) likewise considers its right argument 
a circular chain. If the right argument has p V elements, 
the chain has pV “links”; in the question above we 
considered 60 links. Thus, rotating the chain NV links 
is identical to rotating it (oV)|WN_ links (five in the 
'EIGHTH' example): 


NOV <> ((pV)|N)OV 


This expression maps all left arguments (including 
negative numbers) into a canonical form between 0 and 
(oV)-1. Alternatively, if you prefer rotation to the 
right, you could map left arguments into a canonical 
form between 1-pV and O by 


NOV <> ((-pV)INV)OV 


Thus in the bicycle chain example above, the clever 
(least link movements counted) solution would be to 
rotate the chain backward 10 links rather than forward 
50 links! 

_- 60 “6011250 

50 10 

| should note that APL uses the optimal data movement 
strategy internally, so you need not worry about re- 
ducing the left argument to any minimal form. 

In practice, the (by far) most common left arguments 
to rotate are 1 and 1, used in differencing. Some 
examples follow: 

e CUM represents the running balances in your 
checkbook. Compute the transaction amounts 
(deposits are positive, withdrawals and checks 
are negative): 


TRANS+CUM- 16CUM ® TRANS[1]+CUM[1] 


which can be read as: “Subtract from each ele- 
ment of CUM the previous element.” 


2¥CUM,([0.5] TRANS 
400.00 300.00 250.00 200.05 700.05 
400.00 100.00 ° 50.00 949.95 500.00 


Alternatively, 
TRANS<«CUM- 1+0,CUM 


e Çy is a character vector. Delete all leading, 
trailing, and multiple adjacent blanks: 


CV+' ',CV x NB+CVz! ' 
CLEAN+14(NBV16NB)/CV 


which can be read as: “Select from CV elements 
that are nonblank or that are followed by non- 
blanks.” 

ISTRA t Pe a EP CREAR | 
| DELETE ALL EXTRA BLANKES | 
|DELETE ALL EXTRA BLANKS | 


Alternatively, 


sA SS i Se ee eee: 
CLEAN<1+¥(Bx10B)/CV 


which can be read as: “Select from CV elements 
which are not both blank and followed by a blank.” 
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e Mark the beginning of each series of identical 
elements in the (nonempty) vector V : 


B<+V4 1O6V x B[1]<1 


which can be read as: “Where do elements in V 
differ from the previous element?” 


V * B 

-a ae ee GE ae Ee SE ee Se Se ae 

; Bs « Sees Meme es aa ch c Me | Ste Sect O Sage? Bak | Ee 

e Mark the end of each series of identical elements: 
B+V41OV * B[pB]<1 


which can be read as: ‘‘Where do elements in V 
differ from the following element?” 


V ® B 
o. 3-54-33 SS SS Ss 
f-2- 7S -t ee 8 Se 
As you may have gathered, 1 can be read as “the 
following (or next)” and 1 can be read as “the 
previous (or last)”. 
Another common application of rotate is to align 
matrices. This use derives from the ability of rotate to 
shift many vectors (collected in a matrix or higher- 


dimensional array) at once: 


MAT 
MANGER 
SELVES 
PUTOUT 


30MAT (or 3 3 36M) 
GERMAN 
VESSEL 
OUTPUT 


MATO 
SPRITE 
RECENT 
LAYOUT 
ENLIST 


ESPRIT 
CENTRE 
OUTLAY 
LISTEN 


1-9 3. -_eOMATO 


For example, left justification means rotating a matrix 
to the left until a “significant” element (typically a 
nonblank) is encountered: 


ce ee te” eee S, 
THIS 


| 
| 
| WILL BE 
|LEFT JUSTIFIED 
+/A\CM=' ! 
jie Xe: Ses 
LJ<(+/A\CM=' ')oCM 
a a a 
| THIS | 
|MATRIX | 
| 
| 


Number of leading blanks. 


IWILL BE 
|LEFT JUSTIFIED 


As an exercise, try this problem in alignment and V R«CRTOVR FN;OI0;CR;ALP;LOC;LINES 


justification. (The answer will be given in the next issue.) C1] a <PN> IS THE NAME OF AN EXISTING FUNCTION; 
[2] a <R> IS THE HVER FORM OF THAT FUNCTION. 


C3]  DIO+1 w CR+DCR FN wx +(0epCR)+A 


You are given the LICR (matrix) form of an APL C4]  'FN LOCKED, NONEXISTENT, OR SHADOWED' xw +0 
: : : [5] A:ALP+CRe'ABCDEFGHIJKLMNOPQRSTUVWXYZAABCDEFGHI 
function that Is (of course) less than 100 lines JKLMNOPQRSTUVWXYZA0123456789' 
long. Your job is to convert it to the UVR (vector) [6] a COMPUTE WHICH LINES ARE LABELS QR COMMENTS: 
form, visually identical to the output produced by at wa T ADUN AERE UCAR paas : TE 
A 5 < > < > : 
VFUNCTION(UJV . You may assume that the [9] CR« a YOUR CODE GOES HERE! | 
function contains no new lines embedded in [10] a FORMAT LINE NUMBERS: : 
. ; fii). LINES<!P<[>Q<J]>LI5* OFMT:. 1+1teCcCh 
character constants. The essential problem is to [42] a ADD LINE NUMBERS AND Y DECORATIONS TO <CR>: 
exdent labels and comments one column (or in- [13] CR<(! vf tt Lises Cr Yt) CR At 


[14] a ADD NEW-LINE CHARACTERS, RAVEL, AND REMOVE 
[15] a ALL TRAILING BLANKS: 
fi6)- Het, (ov \ocRe*—*)54)/.08 FOTENE 


A skeleton of the function follows. It is located in v 
workspace 901 WHIZBANG on the APL*PLUS Sys- 
tem. Fill in the code required on lines [7] and [9]. 


CUT HERE | | | | 
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to Develop APL Standard” in the March-June 1980 
issue of this newsletter for more information on the APL 
Standard project recently initiated by ACM/STAPL.) It 
was generaliy agreed at the meeting that a single docu- 
ment was preferred over the two that now exist. The 
question of what should be standardized remains and 
will be discussed at future meetings. 

Also discussed at the Paris meeting was what topics 
should be included in an APL standard. Some of the ac- 
cepted areas are syntax, operators and primitive func- 
tions, definitions of data items, and essential system 
functions and variables. 

A summary of the agreements made at the Paris ISO 
meeting is contained in the 10 July issue of the APL 
Standards Newsletter. Requests for individual copies or 
a newsletter subscription can be sent to: 


Clark Wiedmann 

University Computing Center 
Graduate Research Center 
University of Massachusetts 
Amherst, Massachusetts 01003 
413-545-1500 


According to representatives on the ANSI committee, 
the project to develop an APL standard should take 
about three years. When completed, the standard will 
significantly reduce the need to convert APL programs 
when transferring them between systems and decrease 
the retraining required to familiarize APL programmers 
with new APL implementations. For more information, 
contact Bob Smith, Senior Research Associate at STSC 
and Recording Secretary of the ANSI committee, at 
STSC’s Corporate Headquarters listed on the back of 
this newsletter. 


by Roy A. Sykes, Jr. 


Creative Indexing 


Most people think of indexing as a rather mundane func- 
tion, used only to select elements, rows, and columns of 
arrays. As the June/July 1976 Whizbang pointed out, 
subscription is actually a powerful tool to identify 
elements of an array for selection or replacement. This 
issue discusses some unusual applications of indexing. 


Origin 0 


When []70 is set to 0, indexing can be used directly 
with subscripts of 0 — Boolean values, for example. The 
statement 


>+(£21,22)(M>C] 


branches to 12 if M is greater than C , and to L1 other- 
wise. Likewise, the statement 


+(L0,L1,2N)[2lLpV) 


branches to LO if V is an empty vector, to L1 if V 
has one element, and to LN otherwise. 


Few Elements, Many Indices 


Notice how the following function #IST subscripts a 
four-element vector with a much larger matrix of indices. 
Origin 0 is again used to avoid the annoying subtraction 
of 1 from YEAR and the addition of 1 to the indices. 


V R+YEAR HIST REVEXP;M;0I0 


OT0<+0 

Re'7T4U,X1' OFMT YEAR+114o0REVEXP 
R«R,' *0@'[ 24REVEXPo.>iM+[/,REVEXP ] 
M+ 44+ 1M+5 

R<R,COJQ'BLI2' OFMT Mx0=5|M 


1 A PRODUCES A PROFIT (LOSS) HISTOGRAM. 

2 A <YEAR> IS THE BEGINNING YEAR. 

3 A <REVEXP> IS A 2-ROW MATRIX: FIRST IS REVENUE; 
4 A SECOND IS EXPENSE. THE VALUES MUST BE ONE- OR 
5 A TWO-DIGIT POSITIVE INTEGERS. 

6 

7 

8 


IO 
UJ 
(am A PA G A E AS ey S) PL eS te) 


rar 28 Eos tee Titer N ni 


Ha 
ti 


< 


REV ® EXP 
FAD LEa 
6 8 12 14 18 


i974 HIST REVO .5 JEEZ 

1974 @@e@ 
1975 @@@@ex 
1976 @®®@@®@@e80 
1977 @®@®@@®@8®®800 
1978 @@®@@@8O88e8800 
1979 @®@@SSSSR2SSOSRROOO 
1980 @®@®S@SSSSRSSSRHOBRRBOOO 

5 Ai 1 2 

O 5 6) 


Slicing Transpose to Refine the Result 


As the original article on subscription suggested, one 
way to select scattered elements from an array is to re- 
fine the result of indexing by using dyadic transpose ( & ) 
to select a diagonal. 

In the function that follows we use subscription first 
to select the digit indices (line [5 } ), then to select the 
proper order-of-magnitude character set (line [6 ] ). By 
this time we have too much data, so we select the major 
diagonal of each plane and transpose it (line [ 7 | ). Note 
that 1 O OF is identical to Q0 1 1QR. Finally we 
ravel and remove excess blanks. 


vV R<ROMAN A;QIO 
CiJ] a FORMAT AS A ROMAN NUMERAL ONE 
[2] A INTEGER IN THE RANGE 0-3999. 
[3] OI0<0 
ES Reo FO" 20 22224 are r 25 
fea BOO ME? RAE ee these 4 
[6] Het AG 4S Boot WOT Pe EIER 
es R+, 10 0 QR m Re(Ret ')/R 


ROMAN 1444 
MCDXLIV 

ROMAN 1666 
MDCLXVI 

ROMAN 2507 
MMDVII 

ROMAN 3888 
MMMDCCCLXXXVIII 


This function is particularly instructive, not only be- 
cause of its powerful use of subscription and transpose, 
but also because of its uses of encode( tT )online [5]. 
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Hand computing the operation is a good exercise to un- 
derstand the resultof 0 4 2 27TR. Observe that origin 
0 iS again more appropriate, this time because we're in- 
dexing by the digits of various numbering systems, all of 
which start with 0. 


Precomputed Translation 


There are several special circumstances where we can 
use subscription on a precomputed array to reduce 
processing time significantly. For example, if V isalong 
vector of integers in the range 1 to 15, it may be faster 
to perform (®115)[V] than to calculate eV. Rather 
than computing the natural logarithm of every element of 
V, we simply compute the 15 possible result values 
and select the appropriate elements. Subscription is 
used to translate from one range of values to another. 
The technique can be modified to work with noninteger 
values by adding a “lookup” vector of the unique input 
values: 


PRECOMPUTEDAOUTPUTS [ POSSIBLEAINPUTS 1 INPUT ] 


Removing Duplicate Values 


Suppose you have several thousand account numbers 
ACCTS (integers in the range 1001-9999) from which 
you wish to remove duplicates. The traditional and most 
general technique, 


UNIQ+((ACCTSiACCTS )=1pACCTS )/ACCTS 


is far too expensive (approximately $6.00). A refined 
technique 


SORTED<ACCTSLAACCTS | 
UNIQ+( 141,SORTED410SORTED )/SORTED 


is much less expensive (approximately $0.40), but still 
takes no advantage of an important characteristic of 
ACCTS, namely that it is integer with a known and 
relatively small range of values. 

The ‘“‘Whizbang technique” is to create an incidence 
vector, mark the elements corresponding to the given 
account numbers, then note the positions of the marked 
elements: 

MAX+9999 
IV+MAXp0 ® IVLACCTS]«1 * UNIQ*«IV/1MAX 

Observe that a given element of 7 V may be marked 
(set to 1 ) several times; of course re-marking an ele- 
ment that is already marked has no further effect. The 
cost: about ten cents! 


Fast Epsilon 


In certain cases of membership, we can profitably use 
an incidence vector as a precomputed translation if we 
know that the data has a limited range. For example, 
assume A and B are both large vectors of account 
numbers like those referenced in the preceding section 
(“Removing Duplicate Values’). A represents all ac- 
counts, and B represents accounts opened before 
1979. You wish to flag those accounts opened before 
1979. While AcB is terse, the following statements 
can be considerably faster. 
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MAX<9999 
IV+MAXp0 ® IVLBJ<1 ® FLAG+IV(A} 


We again create an incidence vector and mark the ele- 
ments corresponding to accounts opened before 
1979. We then use that incidence vector as a trans- 
lation vector for the accounts in 4. 

A related technique is described in DETAIL forthe 
variable BITDATES in workspace 6 DATES on the 
APL*PLUS System. BITDATFS is a 123199-ele- 
ment Boolean incidence vector that has been pre- 
marked with all valid dates from January 1, 1900, to 
December 31, 1999. Thus, subscripting BI] TDATES 
by dates in the form MMDDY/Y provides an extremely 
fast verification technique (possibly in conjunction 
with Exception Handling). You could further modify 
BITDATES to flag weekdays, holidays, anniversaries, 
fiscal periods, or whatever you can precompute. 


inverting a Permutation 


lf PV is a permutation vector, then 7?<APV computes 
its inverse permutation, as does JP<PViipPV. Con- 
siderably faster than both algorithms is the following: 


EPer E FF i ry ISIP 


Thus, to rank a numeric vector V (using the permu- 
tation AV ), the expression 


R+ıpV ® RLAVI<R 


is almost twice as fast as R<AAYV, and is usually many 
times faster than R<«( AV) i107. 

Often it is convenient to process data in ascending 
order, but you need the result in the same order as the 
original data. The initial processing is something like: 


DATA<DATA[GV+ADATA ] 
RESULT+PROCESS DATA 


To get the result back into the original order (that is, 
RESULT+RESULT[ AGV ]), you can take advantage of 
the inverse permutation and instead perform 


RESULTULGVIJ+RESULT 
For example, 


VY R+DENSERANK V;P 
C1] a PRODUCES A DENSE RANK OF NUMERIC VECTOR <V> 
[2] a (ABA (O020«1 * 1.DRAWK V) IN WORKSPACE '6 SORT'* 
C3] a ON THE APL*PLUS SYSTEM). 
C4] V+VELP+AV] a ARRANGE <V> IN ASCENDING ORDER. 
[5] a MARK FIRST OCCURRENCE OF EACH DISTINCT ELEMENT. 
[6] V+ 141, V217 
[7] a COMPUTE IN WHICH PARTITION EACH ELEMENT OCCURS. 
[38] R<+\V 
[9] a REARRANGE IN ORIGINAL ORDER (SAME AS R+R[AP]). 
[10] REPJ<R 

y 


GOLFSCORES,([0.5] DENSERANK GOLFSCORES 
76--S6: S58 70-72-77 D7- 73 67 71 67 78. 87-73-71 
$a Ree O See 2 So eS 


Miscellaneous Techniques with Elided Subscripts 


How can we assure an argument is a vector, forcing 
failure if it is not (see the section “Implicit Error Check- 
ing” in the July/August 1979 Whizbang)? Try 4<AL J, 
which uses very little time or storage (no copy of A is 


made). Use A4+(1/A){]to also allow (and ravel) OCR (matrix) form of an APL function. The code you 


scalars. To assure a matrix, use A['';'' |. (Unfor- supplied should resemble that on lines | 7 | and L 9 ! of 
tunately, the current implementation of A[; | busily the following function (available on the APL*PLUS 
indexes each and every element of 4.) System in workspace 901 WHIZBANG): 
Suppose 4 is a numeric array and you want to as- See ee ee LS 
sure it is stored as compactly as possible by APL with- [1] a <FW> IS THE NAME OF AN EXISTING FUNCTION; 
in recision. [2] a <R> IS THE OVR FORM OF THAT FUNCTION. 
DEE ROn ee [3] l0+1 s DRRR FE +(0epCR)+VA 
i RE 'FN LOCKED, NONEXISTENT, OR SHADOWED- ® + 
B+(pV+,A4)p0 = BL I<V x A+(pA)pB [5] A:ALP+CRe 'ABCDEFGHIIKLMNOPQRSTUVWXYZAABCDEFGHI 
: JKLMNOPQRSTUVWXYZA0123456789' 
The above statements demote the internal form to the fe]. a COMPUTE WHICH LINES ARE LABELS OP COMMENTS: 
minimum required to represent the data in A. [7] LOC+(CRU31Je'at)v((+/A\ALP)OCR)C31J=":! 
EE n: USE. <L00> TO. -20DENT.-<CH> APPROPRIATELY: 
; [9] : CRELOCO T- ER 
Rotation Problem — Results [10] a FORMAT LINE NUMBERS: = 
P41] LINESe'P<[2Q<JoZI5' UFMT: 14+1teCR 
a 2] a ADD LINE NUMBERS AND V DECORATIONS TO <CR>: 
The Whizbang on rotation (  ) in the preceding issue of ee er ee 
this newsletter (March-June 1980) ended with an exer- [14] a ADD NEW-LINE CHARACTERS, RAVEL, AND REMOVE 
cise. You were to supply the code to compute and ex- Sos Poke en AEE RAE 
dent lines beginning with a label or comment in the y 
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by Roy A. Sykes, Jr. 


Piotoispoiia Branching 


In the January/February 1977 issue of this newsletter, 
we discussed techniques for branching and iteration. 
The fastest method used a precalculated vector to con- 
trol branching in a simple iterative program (ITERATE). 
Several readers questioned the general applicability of 
such a method, or suggested it was an obscure tech- 
nique. This article will apply the same principle to 
several more complicated problems. We will show how 
the technique can actually simplify and localize the 
often complex and distributed logic of branching pro- 
grams. All examples are in origin 1 ( DI0~+1 ). 


Label Vectors 


The simplest application of precomputed branches is 
the label vector, which contains branch targets (line 
numbers) for various options. For instance, the function 
CH in workspace 78 CH is similar to the “change” 
option in the APL* PLUS Message Processing System, 
MAILBOX (workspace 666 BOX ). Itlets the user mod- 
ify a text string by using control characters like “/” 
(delete), ‘‘.” (insert), “rR” (replace), “v” (view), and 
“F” (move window forward). Here’s what such a pro- 
gram might look like: 


V CHISE FIRST 
EJ setup code 
[2] INP: display window > F+1 
[3] FIRST+(<\Ez#' ')/E 
+(0=pFIRST)p0 


J 
[5] DEL:+(FIRST#'/')pINS © deletion code © +INP 
[6] INS:+(FIRSTz'.')pREP © insertion code Ò +INP 
[7] REP:+(FIRST2#'R')pVUE © replacement code Ò +INP 
[8] VUE:+(FIRSTz2'V")pFWD © view code © +INP 
[9] FWD:+(FIRST2'F')pERR © forward code Ò +INP 


C10] ERR:'"TRY AGAIN" © INP 


After setup on line [1], we display a window of 
text and accept input on line [2] and select the first 
nonblank character on line [3]. If the entry is all blank, 
the function stops. Otherwise, we check the character 
serially against each of the options (lines [5] through 
[9]). If it doesn’t match an option, we move on. If it 
does match, we perform the indicated action and return 
to the input phase. If the control character matches no 
options at all, we report an error and reprompt. 

The difficulty with CH1 is that the control logic is 
smeared all across the program. Furthermore, the line- 
by-line checking for control characters is as tedious to 
APL as it may be to the reader. A better approach is to 
gather all the decision and control logic in one place as 
illustrated in CH2. On line [3] we select the first nonblank 
character, determine its position in the list of valid con- 
trol codes, and branch directly either to the appropriate 
section of code, or to the error message. If the entry is 
empty or all blank, we end execution. 


V CH23E 
[1] ong code 


S > Rew, 

[5] DEL: deletion code © +INP 
[6] INS: insertion code © +INP 
[7] REP: replacement code © >INP 
[8] VUE: view code © +INP 

[9] FWD: forward code Ò +INP 
[10] ERR:'TRY AGAIN' © +INP 


Notice how the structure of the program, though 
fundamentally unchanged, is clearer. Each line is de- 
voted to one operation. The reader need only refer to 
line [3] to determine all possible options. Thus, sub- 
sequent program changes are simplified. One further 
improvement can be made: incorporate the options and 
their branch targets into setup processing. 


V CH;E;OPTIONS;LABELS 
[1] setup code © OPTIONS~+'/.RVF' 
[2] LABELS+DEL,INS,REP,VUE,FWD,ERR 


[3] INP: display window Ò E+ 
Cul +LABELSCOPTIONS1(<\Ez' ')/E] O +0 


This further clarifies the essential decision logic on line 
[4] and eliminates the cost of catenating the labels 
together at every user entry. 


Decision Tables 


Occasionally one may use a matrix of labels when ef- 
fecting state changes, or when controlling user interac- 
tion (as in computer-aided instruction). Consider the 
following example: In STSC’s Deferred Execution Sys- 
tem, the user is allowed to change the state of a De- 
ferred Execution request only in certain ways. Further- 
more, the action taken by the system differs greatly 
depending on the existing state and the requested state. 

Figure 1 — reproduced with modifications from the 
Deferred Execution Reference Manual (STSC, 1979), 
page 7 — shows the allowable state transitions for the 
user. The program that actually effects these changes, 
DFSTATE, must decide whether a transition is legal and 
take the appropriate action. 

We can view the transition diagram in Figure 1 as a 
decision table (see Table 1). Each cell in the table de- 
scribes what should be done if a user attempts a tran- 
sition from one state (a row) to another state (a column). 

By substituting labels for the cells of the decision 
table, we can simultaneously check the desired state 
change and transfer control to that portion of the pro- 
gram which performs the change. We can further modify 
the label matrix to handle nonexistent requests (add a 
sixth row branching to NOTFNWD), and invalid request 
states (add a seventh column branching to INVALD). 
The decision table thus would contain values of labels as 
shown in Table 2. 

The function STATEA that follows uses this tech- 
nique to precompute all branch targets. Located in work- 
space 901 WHIZBANG on the APL*PLUS System, 


it can be copied into 935 DEFEX for execution. 

VY NEW STATEA REQS;C;DT;CUR 
CHANGES THE STATE OF MANY <REQS> TO <NEW> STATES. 
pREQS +> pNEW © A/NEWe1 2 4 5 6 


A 
A 
LJ- -A 
A GET CURRENT STATES (6=DELETED OR NOT FOUND), 
f A AND MAP INVALID <NEW> STATES TO 7: 
[6] CUR+1+6| 1+|0 DFSTATE REQS © NEW+(16).iNEW 
[7] ^a CREATE DECISION TABLE: 
[8] DT+((5 6 PILLEGL), CL] NOTFND) INVALD 
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Boe 


Release 


Current 
State 


[13] Held REL 


[23] Pending ILLEGL 


Eas 
C4; ] 


Running 


Termi- 


ILLEGL ILLEGL 


ILLEGL ILLEGL 


Table 2 — Labeled Decision Table 


Action to Be Taken 


£33] 


Run 


ILLEGL 


ILLEGL 


ILLEGL 


ILLEGL 


C34] 


Terminate 


ILLEGL 


ILLEGL 


BNC 
ILLEGL 


(333) 
Abort 


[ 744 


Delete 


Lora 


- Invalid 


INVALD 


INVALD 


INVALD 


INVALD 


nated 


[53] Aborted 


Figure 1 — Transition Diagram 


Table 1 — Decision Tabie 


Desired State 


(1) (2) (3) (4) (5) 
Current TERMI- 


State 


Hold 
Request 


Release Abort 


Request 


(1) 
HELD 


Illegal illegal 


(2) 
PENDING 


Hold 
Request 


lliegal illegal illegal Abort 


(3) iliegal 
RUNNING 


Bounce Abort 


Task 


illegal illegal 


(4) 
TERMINATED 


illegal Illegal Illegal Illegal Illegal 


(5) illegal 
ABORTED 


illegal Illegal illegal illegal 


[9] DTC1 2 31]*+HLD O DT[1;2]+REL © DT(3;4)+BNC 
(10}.. DT{1.2°-3 ;5]+4Br O DTL1.2 4 S&S 3631+DEL 
[11] a SELECT FROM DECISION TABLE AND BEGIN LOOP: 
[12] C+1 Q +DT<(1 1 QDT(CUR;NEW]),END 
[13] HLD:HOLD REQS([C] O- +DT[C+C+1] 
[14] REL:RELEASE REQS(C] © +DT(C+C+1] 
C15] BNC:CUR+1 DFHIST REQS(C] © CUR+CUR(CUR[;1]133;3] 
[16] CUR+ODL S5xOBOUNCE CUR © STATE REQS([C] O >DT([C+C+1] 
[17] ABT:ABORT REQS([C] O +DT(C+C+1] 
[18] DEL:DELETE REQS([C] Q +DT[C+C+1] 
[19] ILLEGL:'ILLEGAL CHANGE TO REQUEST ‘,*REQS[(C] 
[20] =D CEGE] 
[21] NOTFND:'NOT FOUND: REQUEST ',*REQS[C] 0 +DT([C+C+1] 
[22] INVALD:"INVALID STATE FOR REQUEST ',*REQS(C] 
(23) +bpTlC+c+1) 
[24] END:(¥C-1)," REQUEST',(C=2)+'S PROCESSED" 
y 


Note that, as in cH, the decision logic is localized 
(lines [6] through [12] ), and the iterative “working” 
portion (lines [13] through [20] ) is clarified. 


Parallel Decision Making 


APL is primarily a parallel-processing language. How- 
ever, as when dealing with files, one occasionally must 
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ILLEGL ILLEGL ILLEGL 


C6; ] NotFound| VOTFND NOTFND NOTFND 


HELD PENDING RUNNING NATED ABORTED DELETED o 


Request 


Request 


Request 


ILLEGL ILLEGL -DEL INVALD 


INVALD 


write iterative code. Unfortunately, many users 
write loops in APL without considering ways to 
reduce the relatively high cost of repeated syn- 
tax analysis. Among the techniques: 
(6) e Don’t regenerate values. Calculate and 
store them in advance. 
Use all available information to precom- 
pute values. 

e Parallel-process whenever possible. 

The main goal is to remove as much compu- 
tation and logic from the loop as possible. Good 
setup breeds “tight” loops. Consider the fol- 
lowing function that, given an input file specified 
by I, reblocks it to an output file specified by 
0. All components are vectors. 

V-O- REBLE I3;7;CN;IO;RD;SZ;WR 


Delete 
Request 


Delete 
Request 


illegal 


Delete C1] RD+I(13;1] © CweitI[13;] O S2«14I[2;] 
[2] WR+O[1] © I0+1+0 

Request [3] 0+10 © I+d+1 A <SZ> CAN BE ERASED 
[4] INP:+(I>pCN)p0 © 0+0,0FREAD RD,CN[TI) 
[5] CHK:(J>pI0)p0 © +((p0)2I0[J]) poOuT 

Delete re] T+I+1 © +INP 

Request [7] OUT:(IO[J]p0) OFAPPEND WR  0+I0[J]+0 
[8] d+J+1  +CHK 


Vv 


Lines [1] and [2] decompose the arguments into 
five distinct variables: 


Tie number of input file 

Component numbers of input file 
Number of elements in each input com- 
ponent 

Tie number of output file 

Number of elements to each output 
component 


RD+ILi:;:1] 
CN+1+I[1;] 
SZ+1+I[2;] 


WR+0(1] 
T0<«14+0 


The total number of input elements (+/1+Z[£2; 1) must 
be equal to the total number of output elements (+ /1+0). 
Line [3] initializes the input and output component 
counters ( I and J ) and the output buffer 0. Lines 
[4] through [8] constitute the loop. 

Although REBLK is fairly clear, observe how the logic 
(three conditional branches, two unconditional branches, 


and two counter incrementers) tends to obscure the es- 
sential tasks of the loop (read to buffer, write from buffer, 
drop from buffer). Furthermore, although cw and I0 
never change, the values pCN and pI0 are recomput- 
ed at each iteration. Lastly, the function takes no ad- 
vantage of the known sizes of the input components 
FFARR 

There is a better way! The function REBLOCK that 
follows produces an extremely tight loop (lines [6] and 
[7]) by taking full advantage of the arguments. 


V O REBLOCK I;CN;IO;LN;RD;SZ;WR 

C1] RD+IC13;1] © CW+14+I[1;] O S2Z+14+I[2;] 

[2] WR+0[1] © I0+1+0 

[3] 0-(pIO)24(+\I0),+\ 140,82 © I*+(~0)/1p0 

C4] LN+O\OUT © LN[IJ+IWNP  I0+0O\IO O LOETISCEN 

sj 0+10 © I+1 © >LN+-LN,0 A <SZ> AND <CN> CAN BE ERASED 
[6] INP:0+0,0FREAD RD,IOCI] © +LN[I+I+1] 

C7] OUT:(IOCI]p0) OFAPPEND WR © O+IO[I]+0 © +LN[I+I+1] 


y 
No unnecessary calculations are performed in the loop; 


compute all branch points, rather than serially testing 
scalar values at each iteration. 

Note that zw (set on line [4+] ) incorporates not 
only the read-or-write decisions, but also the end-of- 
loop test (added on line [5] ). IO contains both the 
input component numbers (where ZN is equalto INP ) 
and the output block sizes (where LW is equal to OUT ). 
Thus only one counter ( 7 ) is needed, and the program 
flow is greatly simplified. 


Conclusion 


We've seen that complex branching programs can be 
simplified whenever one knows, in advance, the deci- 
sions to be made. The branching logic is concentrated 
in one place rather than being dispersed throughout the 
program. Decisions are made in parallel rather than 
iteratively. And the essence of the program’s function 
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CMCS” Reference Booklet 


The APL* PLUS Comprehensive Manufacturing Control 
System (CMCS) Reference Booklet was recently 
revised. The pocket-size, 16-page booklet gives the 
locations of all CMCS utility functions, describes the 
parameters used in retrieving information from files, and 
summarizes CMCS capabilities. (N/C) 


by Roy A. Sykes, Jr. 


Advanced Formatting 


Editors’ Note: In the January/February 1981 issue, we in- 
advertently omitted Roy Sykes’ name as the author of the 
Whizbang on “Formatting”. Our apologies to Mr. Sykes. 


In the January/February 1981 issue of this newsletter 
we reviewed the capabilities of JFMT, the formatting 
primitive available on STSC’s APL* PLUS System. In 
this issue we'll discuss some advanced techniques using 
UFMT, all of which derive their effects from terminal con- 
trol characters embedded in the left argument format 
string. The terminal control characters we'll use are 


e [ITCBS — backspace 
e [JTCDEL — delete 


Using these cursor-positioning characters, we can 
make [FMT position data in unusual ways, achieving 
snappy results at moderate cost. These techniques can 
also be applied (often in conjunction with DJARBOUT )to 
controlling full-screen video display terminals and other 
devices. 


e (|TCLF — linefeed 
e ITCNL — new line 


The Title Format 


One common technique is used when formatting a multi- 
page report. Of the multiple lines of header and title in- 
formation, usually only the page number and perhaps 
one or two other items change. Rather than reconstruct- 
ing the header and title at each page, experienced pro- 
grammers expend some extra effort in setting up a single 
all-encompassing format phrase: 

V TF*+SETUP HALFYRPAGES;HALF;YR;PAGES;MOS 
[1] a CREATES TITLE FORMAT STRING. ARGUMENT IS A 3-ELEMENT VECTOR. 
[2] A REQUIRES (DIO+1) AWD <DATESPELL> FROM WORKSPACE '6 DATES'., 
[3] HALF+HALFYRPAGES[1] © YR+-HALFYRPAGES[2] © PAGES+HALFYRPAGES([3] 
C4] MOS* 12 4 p' JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC'! 


(5] TF+'REXNOX DUPLICATING AND PETROL CO.' © TF+(L( 60-p2F)+#2)+2F 
[6] TF+'(',TP,OTCNL a START FORMAT STRING, CENTERED IN WIDTH 60. 


LT] TP+TP,30+(5 “6CHALFJ+'FPIRSTSECOND'),' HALF ',(¥YR),' FORECAST! 
[8] TF+TF, 3047 DATESPELL 3p0TS 
[9] TF+TF,OTCWL,'REGION N tI23.,POUPAGE DON (OF ',(¥PAGES),')0I30,0' 


[10] TP+TP,OTCLF,OTCNL,'TERRITORY',,(6  6[HALF], 7)+MOS 
[11] TF+rF,' rorab*,Urest, TATT ',(49p! 


pTF*+SETUP 1 1981 75 (First half of 1981; 75 pages total.) 


' FIRST HALF 1981 FORECAST 


TF 
m REXNOX DUPLICATING AND PETROL CO. 
MONDAY, JANUARY 19, 1981 
REGION M,LI23,PMPAGE MQM (OF 75)MI30,0 


TERRITORY JAN FEB MAR APR MAY JUN 


Note that we’re not constructing the header itself, but 
rather an enormous format phrase that will create the 
header when used with {]?’'/“7. By collecting all the title 
information in one variable ( TF) rather than many, we 
clarify the main program. At each page, we need only 
execute the following statement to display the header 
and column titles: 


REGION<4400 © PAGE+32 © TF OFMT (REGION; PAGE) 
REXNOX DUPLICATING AND PETROL CO. 
FIRST HALF 1981 FORECAST MONDAY, JANUARY 19, 1981 
REGION 4400 PAGE 32 (OF 75) 


Underlining Numbers 


Examine closely the following format phrase and its effect: 


ES 
K2M< (>N<)>@< >R< >60222.222.229.990 
NUMS+ ~12345678.9 123.45 0 43.21.75319.86 987654321 


FS OFMT NUMS 
(12.345.678.90) 
£323.45) 
0.00 
43.24 
732319.86 
987.654.321.900 


Does either strike you as unusual? On most APL systems 
we can’t underscore punctuation or digits! 

It's clear that we have some strange characters in FS. 
Let's try displaying and applying a function that reveals 
control characters like backspaces ( #) and deletes ( Ï). 


V  ReRCC 42:83¢ 
J] a REVEALS CONTROL CHARACTERS BY SUBSTITUTING 
2] a VISIBLE SURROGATES IN A CHARACTER VECTOR. 
C+OTCBS ,OTCDEL ,OTCWL ,OTCLF ,OTCNUL ,OTCBEL 
4 ] BeAeC © R+B/ipB © ALR]+-'BDNLIR'[C:ALR]] 
J Ret 8472" 1 2°07." © Ufrees tore. sis 


pFS 
88 

4O+FS © RCC 4Ot+FS 
K2M<(>N<)>@< >R< > 
K2M<(b_>N<)BBBBB_ >> >Q@<BBBB 

4O+FS © RCC HOES 

2422.,229.990 
2B _28_,B_2B_2B_2B_,B_2b_zB_9 .99bb ddd DDO 


>R<DD > 


GUZ 


22 
GMZB_ 

We see that what appears to be a single character 
(like Z) is actually the composite of a Z (a control charac- 
ter in G format), a backspace ( UTCBS ), and an under- 
score ( _ ). The background fill 


is crucial; without it any fields printed to the right would 
be misaligned because of the blanks with which G format 
replaces the (very long) pattern. Notice what happens if 
we overflow the visible field width (15 in this case): 


(FS OFMT NUMSx10),'0' 


BOK KK TOK KOK KOK IO RR ek [] 


753,198.60 D 


Ok kk kkk kok kok kok ok kok kok ok kok ok kok kok ok kkk kk kk kk kK eT] 


pFS UFMT NUMS*x10 
6 45 
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All positions are filled with stars, and the physical field 
width of 45 becomes apparent. The delete characters 
are used to fill positions that would otherwise be spaces, 
thus allowing FS to be used with other format phrases. 

The function ULFMT (available in workspace 
901 WHIZBANG on the APL*PLUS System) illus- 
trates the basic technique for underscoring numbers 
with FMT. 

V R+ULFMT -Ar KEW 

Ci] a FROM A 1- OR 2-ELEMENT INTEGER VECTOR a{,w} 
[2] a REPRESENTING QFMT PHRASES ‘'Ia' AND 'Fa.w', 
Pao a GENERATES AN EQUIVALENT 'GMO' FORMAT PHRASE 
L4 J a WHICH UNDERLINES NON-BLANKS IN THE RESULT. 
oa R+2+K+1+W+2pA, 1 © W+ipW © W+(3xW-R)p'ZB_' 
[6] A .W,(Rp'9.',Rp'9'),(, S EY a E AN 
[7] R«((K>0)/'K',¥K),'M< B_>R<',(2p0TCDEL),' >',R 


C8] RCUCR='B')/ipkI]<OTCBS 
y 


The essential elements are the G picture phrase (con- 
structed on line [6 ]), and the K scaling and R back- 
ground (line [ 7 ] ). Let’s try revealing the control char- 
acters of a resultant format string, and using it to for- 
mat NUMS. 


£OC ULFULFMT- 12 
oM<"B_>R<DB >Gz8_28_2B_2B_28_28_2B_23_o. 99BBBB____o 
THT me oe ne mn “ae ri Nii au 
Ai L 
-QM<">R< >GMZZZZZZZZ9.929Ù 
UMS<«NUMS ,1E7 
Se ee? tik? F12.21)- BF -( 175 NUMS; FUNS) 
1 712345678.90 ~12345678.90 
“457 ut 123.4 
i f 0 
43.21 43.21 
15319.86 75319.86 
€ 987654321 Q 987654321 


* Ic 


fi KKK KKK KKK KKK KKK KKK KKK KKK KKK KEK KK KEKE KEK KKK KKK KKK KKK 


Variable-Length Formatting 


Judicious use of qualifiers and decorators can produce 
results difficult to achieve otherwise. [JF /T can distin- 
guish between four classes of numbers: 
e Positive numbers, decorated by P or Q quali- 
fiers. 
e Negative numbers, decoratedby M, W,or 5S qual- 
ifiers. 
e Zero, decorated by B, P, Q, R,or Z quali- 
fiers. 
e Large numbers which overflow, decorated by the 
S qualifier. 


We can combine these capabilities with control charac- 
ters, notably the delete character ( OTCDEL ), to obtain 
unusual effects. For example, the phrase below blanks 
all negative numbers by inserting a negative decorator 
too wide (eight characters) for the specified field width 
(six). Hence negative numbers overflow, causing the 
field to be filled with stars (*), for which blanks ultimately 
are substituted. 


MT 
255 oe Tia 999 0 34 -41592 
21 48201 8 “28 4590 “450 a 
'S<x >M<OVERFLOW>I6' (FMT MT 
oo 113 0 a2 S23592 
27- 18281 8 4590 


Alternatively, we can substitute a delete character for 
the overflow indicator to remove all visible traces of the 
numbers. 
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('S<*' DTCDEL,'>M<OVERFLOW>I6') OFMT MT 
Zag 113 0 31 41592 
2? 26261 8 4590 


Finally, we can also use a delete for the background 
(making sure to leave a space between nonnegative 
numbers) to achieve what appears to be selection and 
variable-width formatting of nonnegative integers: 


FP<'S<x!,OTCDEL,'>M<OVERFLOW>R<',OTCDEL,'>Q< >I6' 
FP (FMT MT 

255-2235: 0-32-4259 2 

22, A8201 F-45350 


Multiple -Line Formatting 


The final example combines most of the features 
described in this and the previous article to achieve multi- 
line, variable-length formatting in a typical accounting 
application. 

The matrix CHECKS contains the following informa- 
tion as nonnegative integers: 


CHECKS[ ;1 ] — check number (up to six digits) 

CHECKS[ ;2 ] — payee number (up to five digits) 

CHECKS{:;3]— check amount (in cents, up to 
nine digits) 

CHECKS[ ; 4 ] — date (YYMMDD) 

CHECKS[ ;5 ] — reference number (up to nine 
digits; zero if none) 


Each row of DISTR, an integer matrix of arbitrary width, 
is associated with arow of CHECKS. DISTRcontains 
N (N>1) pairs of columns, each pair containing the distri- 
bution account number ( DISTR[ ; 1+2x1W)) andthe 
distribution amount ( DISTRL ;2x1N J ), in cents. Un- 
used pairs contain zeros. Total distribution amounts 
equal check amounts. Also associated with each row of 
CHECKS isarowof DESCR, a 50-column matrix. 


CHECKS 


54345 282 1000000 801031 147972 
29394 1243 30710 800703 0 
17676 747 246826 810116 757842 
54900 9999 120000000 801223 54001224 
((2x1+pDISTR)p0)¥DISTR 
423 50000 424 600000 456 350000 0 0 
424 30710 0 0 0 0 0 0 
456 163510 458 45263 662 1429 798 36624 
5410 12000000 5420 108000000 0 0 0 0 
DESCR 


20°/° DISCOUNT FOR REXNOX DUPLICATING a PETROL CO. 
SEE CHECK 29266 FOR BALANCE OF PAYMENT 
100/0 DISCOUNT ON WIDGETS; 7°/° DISCOUNT ON GIZMOS 
PAYOFF OF 12/24 NOTE TO SHARK LOAN CO. 


Given the data listed, we wish to produce the following 
report: 


HEADINGS © (MLF QFMT CHECKS ,DISTR-DISTR=0) ,DESCR 


CHECK DISTRIBUTIONS _ 
NUMBER DATE _ PAYEE CHECK AMOUNT ACCT _AMOUNT(S) REFERENCE 
54345 10/31/80 282 10,000.00 423 500.00 147972 
424 6000.00 
456 3500.00 
200/0 DISCOUNT FOR REXNOX DUPLICATING a PETROL CO. 
29394 07/03/80 1243 307.10 424 307.10 
SEE CHECK 29266 FOR BALANCE OF PAYMENT 
17676 01/16/81 747 2,468.26 456 1635.10 757842 
458 452.63 
662 14.29 
798 366.24 


10°/o DISCOUNT ON WIDGETS; 70/0 DISCOUNT ON GIZMOS 
54900 12/23/80 9999 1,200,000.00 5410 120000.00 54001224 
5420 1080000.00 
PAYOFF OF 12/24 NOTE TO SHARK LOAN CO. 


The APL expression certainly looks easy enough. 
Actually almost all of the “programming” in this example 
is devoted to constructing the rather elaborate 175- 
character format string. We'll display the format phrases 
grouped by the columns of data they format: 
76,731,6M 2,222,2293.99[, (check number, payee, amount) 


T14,CGMN99BBBBBBBB99/939/M, (date) 
BI4u7,752,724,GM 2222229.99f], (reference, first distribution account, amount) 


T77,MÄÑ ™, (new line) 
99(S<*D>N<OVERFLOW REPLACES *S BY DELETES>I32 (distributions) 
,S<*D>M<OVERFLOW>GO) 2222229.99N m) 


Several points illustrated are worth reviewing. 

1. Don’t format character data. 

2. Use G phrases to clarify and reduce the number 
of format phrases and qualifiers. Observe that 
the phrase GM Z,ZZZ,ZZ9.99M positions 
( X1, ), scales ( K2 ), decorates ( C ), and formats 
(F12. 2 )the check amount. 


$: 


8. 


Use field width (like 731) to position data rather 
than positioning phrases ( X26,75 or733,J5). 
Use positioning phrases (like 714) to avoid re- 
arranging the data. 

Use new-line (4) and linefeed (Ë) characters to 
display one row of data on several print lines. 
Use backspace (Ë) characters in a G phrase to 
decorate and even rearrange digits. Notice that 
the date was formatted as MM/DD/YY in the report, 
but represented as YYMMDD in CHECKS 34 J. 
You can deliberately overflow classes of numbers 
to achieve special effects. 

Use the delete (2) character to make undesired 
data, decorations, or background invisible. 


Whatever its purity, QFMT is certainly powerful, prac- 
tical, and neat! 
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Classes and 
Workshops 


vs 


Hartford 
Advanced APL Programming Techniques Houston 
Denver May 28 Los Angeles 
Hartford May 20 New York City 
Los Angeles May 13 Raleigh 


Washington, D.C. May 14 San Francisco 


CMCS implementation Workshop 
Boston June 8, 9, 10, 11,12 
Paris 


Computer-Aided Thinking for 
Financial Executives 


FPS Introduction 


Baltimore May 19 Los Angeles 

Chicago May 5 

Denver May 7 FPS Workshop 

Los Angeles May 14 Chicago 

San Francisco May 12 New York City 
Rochester 


Data Management with APL 


Houston June 1, 2 


Electronic Message System Workshop Houston 


Washington, D.C. June 3 


Introductory APL 


Files and Formatting Chicago 
Boston May 6 Denver 
Chicago May 21 Detroit 
Detroit May 8 Elmsford 
Elmsford May 22 Hartford 


by Roy A. Sykes, Jr. 


The Interval Functions 


Many common operations in APL require several primitive 
symbols. For example, an ascending sort is written as 
V{ AV J, and the rank of an array is expressed as ppA. 
Alan Perlis and Spencer Rugaber named these fre- 
quently occurring expressions “APL idioms” in a research 
report published at Yale University in 1977. The term 
has become widely accepted to describe short pieces of 
code that experienced programmers recognize at a 
glance. For example, ppA is not read as “the shape of 
the shape of A ”, but rather as “the rank of A ”. 
Needless to say, one man’s idiom is another man’s 
program. One measure of the experience and capability 
of a programmer is the number of idioms he is able to 
recognize. Where novices must struggle through code like 
((ViV)=19V)/V, old hands recognize it instantly as 
the idiom to remove duplicates from a vector. Thus, part of 
learning APL is encountering, verifying, and finally internal- 
izing many short pieces of code with wide applicability. 
In this article, we'll dicuss several idioms that are used 
to determine whether a number occurs in a given interval. 
In conventional notation [LO, HI] is called the “closed” 
inteval LO<X<HI, and (LO,HI) is called the “open” inter- 


The following STSC sponsored seminars and workshops will 
take place in May and June 1981. For a complete description 
of each topic see the Spring 1981 Seminar Schedule available 
from your APL* PLUS Service representative. For more infor- 
mation about times and locations, contact one of the STSC 
branch offices listed on the back of this newsletter. 


Washington, D.C. 


Forecasting Techniques 


June 17, 18, 19 


Washington, D.C. 


intermediate APL 


May 12, 14, 18, 20 
June 9,11,15,17 


May 1, June 19 


Houston June 3, 4 


June 9,10 
June 5 London May 11, 12,13 
May 8, June 18 Los Angeles June 2, 3,4 
May 24 New York City May 4, 5, 6, 7 
June 16 June 9, 11, 16 
June 24 San Francisco June 8, 10, 12 
Washington, D.C. May 26, 27, 28 


introductory APL & Files and Formatting 


May 21 
Paris May 11,12, 13, 18,19 


Program User’s Ciass 


May 26 
Dallas May 6, 7 


QUICKPLAN™ 


May 19, 20 Houston June 11,12 
May 20, 21 Los Angeles June 1 
June 18, 19 San Francisco June 23 


Statistics/Econometrics 
Washington, D.C. May 15 


June 5 


STSC Services for the APL Programmer 


May 18, 19, 20 Philadelphia May 13 
May 6, 7,8 
May 4,5, 6, 7 VM 


Paris May 4,5,6 
May 4,6,8 


Washington, D.C. 


val LO<X<HI, which does not include the endpoints. 
[LO,HI) and (LO,HI] are also defined and often called 
“half-open” (but not “half-closed” for some reason — 
perhaps “ajar” would be better). The following table 
gives APL analogs: 


Closed Open Half-open Half-open 
[LO,HI] (LO,HI) [LO,HI) (LO,HI]J 
LO<X<HI LO<X<HI LO<X<HI LO<X<HI 


(LO<SX)AXSHI (LOK<X)AX<HI (LOSAYAX<HT (LO<X)AX<HI 


It is slightly annoying that APL has no single primitive 
function to express interval relations. We must use two 
primitive relational functions and compute whether both 
are true. Ideally, we would like a single function having x 
as one argument and the bounds as the other argument. 

In fact we can construct such a function by analyzing 
the APL analogs above and using identities to derive new 
formulations. For simplicity, we will assume X to bea 
scalar. We also need to define a parameter RES that 
allows us to equate the > and < functions with 2 and 
< as follows: 

X>K ++ X2K+RES 
X<K #2. XSK-RES 
X>K <«»> X>K-RES 
XEK <> X<K+RES 


For integer comparisons, RES is 1; for example, X> 4 
is equivalent to X25 for integers. Thus RES is the 
desired absolute resolution of our comparisons. 

Given the APL analog above, let us derive a new idiom 
for the closed interval: 
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(LOsX)aA XSHI 
(XLO YAXSHI 
(X2L0)A~X>HI 
(X>LO)A~X2>HI+RES 
(X2>LO)> X>HI+RES 
>/(X2L0), X2HI+RES 
27- X2 LO; HWEFRES 
A > «<2 DOIS HITES 


The goal of the derivation is to express the relationship 
between X and the two bounds using the same rela- 
tional function ( = in this case). If X is in the given inter- 
val, one of the relations will be true and the other false. 
We then transform the scalar computations into a single 
inner product. 


((WESTING)(NOTE(S))) 


With the release of STSC’s Nested Arrays System 
(see the article entitled “Nested Arrays: Powerful 
Extension to the APL Language”, which appears else- 
where in this issue), | am initiating a new feature in the 
“Whizbang”. In each issue, we'll briefly explore some 
capability of this extraordinary system. | say “briefly” 
because the system is highly ramified; delving into any 
one subject is practically impossible without exploring 
many others. Limited space (and time) preclude such 
treatment in depth. 

STSC’s purpose in releasing the system is to en- 
courage research and practical exploration of its capa- 
bilities. Gradually many features introduced in the 
Nested Arrays System will appear in STSC’s produc- 
tion time sharing systems and distributed software. 
Others may be abandoned in the face of more 
cohesive capabilities. 

It is you, our readers and users, who will largely 
determine what features are released commercially. 
Those features will be dealt with in the main body of 
the “Whizbang”. These notes will simply try to pique 
your imagination. We solicit your comments. 

One of my first exercises using the Nested Arrays 
System was to reformulate problems I'd already solved 
with simple arrays in APL. An example is the function 
SSMAT, which follows. It produces results identical to 
those produced by the function of the same name in 
workspace 6 INPUT and the function VTOM, 
which appeared in the January/February 1980 
‘“Whizbang”’: 

VY M«SSMAT V 
C1] A FROM A SEGMENTED STRING (A SIMPLE VECTOR WITH 
[2] A IS- AS THE DELIMITER PREFIXING EACH SEGMENT), 
ee A MAKE A MATRIX WITH EACH SEGMENT LEFT JUSTIFIED 
[4] A IN A SINGLE ROV. 


[5] M+14¥°(V=2aV)cV x Me+(0l [7o M)+ M 
V 


The Nested Arrays System allows you to deal with 
variable-length data directly, thus often obviating the 
need for functions like SSMAT. Nonetheless, the 
function is instructive. Here’s how it works: 


SSMAT '/I/WORK/TO/EAT'! 
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Using similar methods we can derive seven other inner 
products that also express the closed interval: 
X<.<LO,HI+RES 
X>.>(LO-RES),HI 
Ex .=(2G-RES).8I 
X>.<(HI+RES),LO 
X=. >(HT+RESY-LO 
X> sH, O-RES 
Ye. SNI LO-RES 


In all cases the reduction function (between X and the 
period) can be replaced by = ; the relation holds true if 
either condition is true and the other is false. Further- 
more, we can negate the relation (“Is X outside the 


Disclose returns the first item of a simple array, 


57y 
/ (simple scalar) 


which equal compares to produce a Boolean partition 
vector: 


Vay 
10100004100%1000 (14-element 
simple vector) 


Partitioned enclose partitions the simple vector into a 
nested vector, 


( Yeo jer 
(ZI) (/WORK) (/TO) (/EAT) (four-item 
nested vector) 


from which we drop the first element of each. 


M+1+ (V=>V)cV 
(I) (WORK) (TO) (EAT) (four-item 
nested vector) 


We find the shape of each item, 
a 
Ce 087042 3-493 (four-item 
nested vector) 


and compute the highest (or 0 if no items): 

Off /p M 
(4) (nested scalar) 
We take each item to the same length, 


(Off /p M)t*t mM 
(f ) (WORK) (TO ) (EAT ) (four-item 


nested vector) 


and mix the items back into a simple matrix. 
M<++(Of[/p M)*+ M 


7 
WORK (four-row 

TO simple matrix) 
EAT 


Notice how the comments in the function increas- 
ingly overwhelm the code as the expressive power of 
APL grows. 


closed interval [LO,HI]?’’) by replacing reductions < and torva Range idionis Examples 
> with > and < respectively (e.g., X<.2LO,HI+RES ), | ee ae Bh 
imply by using = in all cases (LO, HI} LO<X<HI X<.<L0,H1 YEAR<.<1900 200 
or simply Dy using = ' poy half-open Is it the 20th century? 
Whew! We have 32 different inner products just to 
express the closed interval and its complement. As you 
might expect, there are also 32 inner products for the 
open interval and each of the two half-open intervals and (LO,HI) ee re O EEE see Lees. ee T 
Rather than listing all 128 combinations, we will pre- [LO,HI] LO<TX<HI-X<.<LO TEMP<.<70 80 
sent only four suggested idioms for range testing. The closed eet te 
interval LO X<HI pleasantly has X¥<.<ZLO,HJ as one 


[LO,HI) LOSXSH -ere LORI TRE <220 31 
half-open Is TN a valid tie number? 


of its APL analogs, so we will use this as the basis of our We can now recognize the <.< and <.< inner 
suggestions. (See table at right.) products as idioms which need not be decomposed to 
Notice that intervals whose lower bound is open (LO<X) understand their effect. There are many other inner nrod- 
are coded X<.<ZO,HI while those whose lower ucts that experienced programmers recognize. Do you 
bound is closed (LO<X) are coded X¥<.<ZO,HI. recognige: +. %.- AS. %a* <*> YA eR Le 
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REPLY FORM 

[] Please send to the address below __________ copies of APL in Practice at a price of $25.00 per copy. Enclosed is 
a check for . (Prepaid orders only.) 

| would like more information about APL* PLUS Service for the application 


| would like to order an STSC publication. Please send me a Publication Order Form. 

Please register me for the next three-day Introductory APL class in city 

Please add the individuals listed below to your mailing list and send them a copy of this newsletter. 
Please correct my address and/or telephone number as shown below. 

Please remove my name from your mailing list. 


ey a) dad 


Comments: 
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Mike Quataert joined STSC in 1977 as an Applications 
Consultant for STSC’s New York City Banking Group and 
became a Marketing Representative later the same year. 
Quataert was awarded STSC’s Outstanding Salesperson 
Award and Outstanding New Business Salesperson 
Award in 1980. 


Michael W. Quataert 


Before joining STSC, Quataert was employed with 
Citicorp in New York City and the National Technical 
institute for the Deaf in Rochester, New York. Quataert 
holds a B.S. degree in Computer Science from the State 
University of New York at Stonybrook and is a member of 
the Association for Computing Machinery (ACM). 


Just Off the Press 


-nA A Á aid 


APL 4 PLUS Data Management System User’s Guide 


The APL * PLUS Data Management System User’s Guide 
describes STSC’s new, conversational system for build- 
ing, modifying, manipulating, and displaying rectangular 
tables of data. (See “Conversational Data Management 
System Simplifies Table Editing” in the March-April 1981 
issue of this newsletter.) Once defined, a data manage- 
ment system can be used to maintain data for inquiry or 
display. The 33-page guide presents an example illus- 
trating the use of the system, and provides step-by-step 
instructions on each of its features. $5.00 


Collected Whizbangs, Volume 2 


Collected Whizbangs, Volume 2 is acompilation of “Whiz- 
bangs” taken from past issues of this newsletter. Volume 
2 contains a dozen ‘‘Whizbangs” covering the period 
from July 1978 through October 1980. The collection 
is reprinted by STSC as a handy compendium of APL 
techniques and tutorials for reference, careful study, or 
leisurely perusal. $6.00 
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Enhancements, Version 3.0.1, Supplement 2 


Supplement No. 2 to Enhancements in the APL x PLUS 
System, Version 3.0.1, (STSC, 1979) describes recent 
enhancements made to the APL* PLUS VM Time Shar- 
ing System. Among the enhancements described in the 
44-page supplement are the two system commands 
)RESET and )EDIT, the system variables UTCDEL, 
OTCNUL, and DAT ,and the systems functions [JA RBIN 
and JJARBOUT. 7 (N/C) 


HPPLOT User’s Guide 


The HPPLOT User’s Guide describes a group of APL 
programs that produce four-color business charts and 
graphs on the popular Hewlett Packard 7221 Pen Plot- 
ter. With the functions in workspace 10 HPPLOT, you 
can produce point plots, line plots, bar charts, and pie 
charts to give you a quick and easily understood ‘“‘pic- 
ture” of your data. The 43-page publication contains 
several illustrations of the four-color displays you can ob- 
tain. A section is also devoted to auxiliary functions that 
allow the experienced programmer to create custom APL 
graphic applications. $6.00 


SOLUTION-33 User’s Guide 


The SOLUTION-33 User’s Guide describes STSC’s pro- 
prietary application, SOLUTION-33™, designed to meet 
the inflation accounting requirements set forth in State- 
ment Number 33 of the Financial Accounting Standards 
Board. The guide includes many sample terminal ses- 
sions showing you how to customize the application for 
your company’s particular needs; examples of work- 
sheets, restatement calculations, and standard reports 
and graphs; and appendixes on report definition and plot- 
ting. The 137-page manual comes in its own looseleaf 
binder and contains an index and tab dividers for easy 
reference. The guide is STSC licensed material, and is 
available only to customers who execute a license agree- 
ment. $15.00 


Punctuation 


APL has always had a notable lack of punctuation. Not 
that traditional punctuation marks aren’t widely used — 
it’s just that they seldom punctuate. Here is a short list of 
well-known punctuation marks and the rather creative 
uses to which the implementors of APL put them: 


Name of APL Uses (operator, function, 

Punctuation Mark Symbol punctuation) 

comma » comma ravel, catenate, laminate 

period . dot product (inner and outer), 
decimal point 

hyphen - bar negate, subtract 

virgule / slash reduce, compress, replicate 

question mark ? query roll, deal 


exclamation point ! shriek factorial, combinatorial 


The terseness of APL is evident in the names of its func- 
tions. Comma is especially notable in this regard — it per- 
forms an operation that languages like PL/I superfluously 
call concatenation, and its monadic usage is synonymous 
with unravel. The name for “ ! ” is also contrastingly 
brief, and strangely yet appropriately provocative. If APL 
had a fire function, we doubt that it would be inflammable! 

APL does employ some punctuation. Defined func- 
tions use dels (V) to delimit their definition, and colons 
( :) and semicolons ( ;) to set labels and local names 
apart. Diamonds (©) separate statements on a single line, 
and lamps ( aà ) introduce comments. Quotes (') are 
used to enclose character constants. Nevertheless, APL 
statements remain almost unpunctuated. 

We say “almost” because some other vestiges of 
punctuation remain. For example, an expression like 
MATRIXlLROWS;COLS] uses the semicolon not as a 
function or operator, but as a list delimiter. Semicolons 
are also used in the obsolete “mixed output” facility and 
in the list right argument to UFMT. The right argument of 
LJFM7 is also unique in that it must be enclosed in paren- 
theses. 

Parentheses are in fact the most widely used punc- 
tuation marks in APL. They don’t serve quite the same 
purpose as in natural languages (like English), and strictly 
speaking they are unnecessary, for their “only substan- 
tive use . . . is to form the left argument of a function” 
(APL Language, IBM, 1978). For example, expressions 
like ((A*2)+B*x2)*0.5 and ((1+0M).,10)+M can 
always be reformulated as several parenthesis-free state- 
ments: 


TEMP<+Ax2 O TEMP+TEMP+B*2 © TEMP«*0.5 
ROWS<1+0M © COLS+ROWS,10 © COLS+M 


An exception to the rule is the list right argument to 
OFMT (but see “Nesting Notes” below), which requires 
Surrounding parentheses, and which consequently has 
the distinction of forming the only APL statements neces- 
sarily ending with a right parenthesis. Of course, no valid 
APL statement can begin with a right parenthesis, thus 
providing us with a convenient escape character for sys- 
tem commands. 

In practice, parentheses are used for several reasons. 
The parenthesized expression of the Pythagorean 
Theorem given above is certainly clearer than the multi- 
statement version. Likewise, most of APL’s selection and 
structural functions (such as compress / , reshape 0 , 
and transpose &), have the control information in their 
left argument. It is often clearer to include the computa- 


tions which generate selection criteria in the same state- 
ment that is performing the selection, as illustrated in the 
matrix example above, and in the one below: 


(P>100)4M 
rather than 
B«+P>100 © BAM 


Occasionally one uses superfluous parentheses to 
Clarify a statement, particularly one involving a relational 
test. This usage conforms to the natural tendency of 
English speakers to put subject before predicate: “He is 
six feet tall.” rather than “Six feet is his height.” and “The 
price is more than $100.” rather than “$100 is less than 
the price.” In APL, 


H=6 rather than 6=H 
P>100 rather than 100<P 


While the subject of an English sentence may be 
modified without introducing further punctuation (“The 
price of new cars is over $10000.”), the equivalent left 
argument to an APL relational function must be paren- 
thesized. Thus one often sees 


(NEW/CARS )>10000 
rather than the shorter, but less directly readable 
10000<NEW/CARS 


A related use of parentheses occurs when construct- 
ing expressions to eliminate superfluous operations. One 
should avoid introducing unnecessary operations simply 
to eliminate a set of parentheses. It is far preferable to 
write 


(X+20)+100 OF 0.2+X+100 


rather than 
$100#X+20 


It is likewise more natural (and efficient) to perform 
(-N)+V rather than dN+OV. 

Parentheses can have a dramatic effect on execution 
speed. Just as “Hal while Had had had had had had had 
had Had had had had a better effect on the teacher” 
takes longer to read than its punctuated version, so too 
does + /V:pV take longer to execute than(+/V)zpoV. 
There are many examples of grouping to reduce compu- 
tation. While 


NEW/CARS>10000 

will execute correctly, 
(NEW/CARS)>10000 

will usually execute faster. Similarly 
FL3]+GL4J+H[6]pr 

can be modified to 
(FL3J+GL4))+AL6 Jer 


typically almost halving the number of additions required. 
Some people adopt a variety of other stylistic guide- 
lines for parentheses. One is simply stated: If one of two 
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expressions is to be parenthesized, parenthesize the 
shorter one. In the above example, 


(FC3J+GL4J))+HL6J) pr 
would become 
(HL6JoT)+FL3I+GL4) 


In the absence of other more important criteria, this 
guideline makes sense because it reduces the number of 
characters the eye must scan to encompass a sub- 
expression. A similar guideline is to avoid more than three 
or four levels of nested parentheses in a statement. 

Somewhat more controversial is the admonition never 
to enclose assignments within parentheses. The under- 
lying rationale is that such embedded assignments repre- 
sent statement side effects which should be avoided. 
This stricture is often further strengthened to completely 
eliminate from statements multiple assignments which do 
not have the same right argument. Thus 


((NWESTING) (NOTE(S))) 


Now that we’ve reviewed current APL punctuation, 
let’s look at the future. STSC’s Nested Arrays System 
uses all of APL’s traditional punctuation rules, but 
adds a few new twists. Moreover, although semicolon 
is still supported for the list arguments to subscription 
and DFMT, we'll see that it has been obsoleted by 
more consistent and advanced facilities. 

Two of the new features in the Nested Arrays Sys- 
tem are strand notation and generalized operators. 
Both facilities use parentheses in new contexts. 

Strand notation generalizes APL’s current function- 
free representation of vector constants, and can 
further represent nested vectors. As such, it elimi- 
nates the notational need for catenation (. ) and en- 
close (c). As today’s constant numeric vector, 


> Soe. M33 Fa 
5 33.243 


is shorthand for 
(92S, 48) 37793 


(as was originally required in early implementations of 
APL) so 


ba Se Bd ee a Re 
(6: 9 49) (35 63) ties 169 9f- 233 


is shorthand for 

Peo 5 ynt 8) ele F333 7 9 
Parentheses are used to group items of the three- 
element nested vector. 

The concept of an operator has been generalized 
and solidified in the Nested Arrays System. Whereas a 
function (like +, OCR, or +.x ) takes data as argu- 
ments, returning data as the result, an operator (like 
reduction / or product . ) takes data or functions as 
arguments, returning a function as the result. Thus, in 


CEJ F 2 36" 2280 
27 
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A+B+2x131 

is acceptable, but 
A+B+(C+2)*D<131 

should be reformulated as 
C+2 Q D+131 Ò A+B<CxD 


The function BRKOUT in workspace 11 TOOLS onthe 
APL* PLUS System is available to “cleanup” such em- 
bedded assignments. 

To summarize, APL currently uses only quotes, semi- 
colons, and parentheses to punctuate statements. Semi- 
colons are used to separate items in a list, and are widely 
regarded as anachronistic. Parentheses are used to 
group expressions forming the left argument to func- 
tions. While formally optional, parentheses are widely 
used to reduce the number of statements, clarify expres- 
sions, and improve efficiency. 


the reduction operator accepts the addition function 
as its left argument, producing the summation func- 
tion, which is then applied to the data. The paren- 
theses are not permissible in conventional APL; they 
are acceptable (though optional) in the Nested Arrays 
System because functions are recognized as group- 
able entities. 

There are many extraordinary consequences of 
this formalization. We see that monadic operators 
have their argument on the left. Furthermore, opera- 
tors have long left scope and short right scope — just 
the mirror-image of functions. Hence, the right argu- 
ment to an operator is the single object (primitive func- 
tion, name, or constant) to its right. Furthermore, 
since the right argument to an operator can be data, 


(+\*3) 5 2 3 4 equivalent to +\+\+\ 5 2 3 4 
5 17 39 75 


we need to distinguish the operator’s right argument 
(3) from the right argument to the derived function it 
produces(5 2 3 4). Parentheses are used to iso- 
late the derived function. 

To paraphrase IBM’s APL Language manual again: 
A consequence of [these] rule(s] is that the only sub- 
stantive use[s] of parentheses [are] to form the left 
argument of a function [, to form the right argument of 
an operator, to nest items in strand notation, or to iso- 
late derived functions]. 

Although parentheses become more important, the 
need for semicolons is eliminated (except in function 
headers, where they are unobjectionable). (FMT 
now accepts a nested vector as its right argument, 
replacing the former list. 


‘7376.7, Tut DPR? €2 3-4 Sey ise Fs 1) 


—_— 


-t a 


A strand notation assignment: 
ABC + (13) (3 5 p'CROW DUCK GOOSE') (2 89 65) 
A 


CROW 
DUCK 
GOOSE 


C 
2.89 6§ 


pŪU+A BC 
(i 2.3) CRO? (2 89 65) 
DUCK 
GOOSE) 


"DF4.0,5A1,GM IS Z2Z¢/LB.M' OFMT ABC 
CHOW JS Sgp. 
DUCK IS 89¢/LB. 
GOOSE IS 65¢/LB. 


Subscription now accepts a nested array, each item 
of which is a vector whose length is the rank of the 
indexed array. 


BIES 29-44-32) 62° 3) €2 2) Oo wi -¢5.-8)3 
GROUSE 
BEL? 1425 464) 
DC 
EGO 
sit .2°9-o, 5 3 
SF 2 TF g 
3 67 See 15 3) 
ee 
e +... 3 4-33 
DC 


7 
J 


Perhaps someone can come up with new uses for the 
forlorn and deserted semicolon! 


Cherlin holds a doctorate in Mathematics from Rutgers 
University. He is a Fellow in the Society of Actuaries, a 
member of the American Academy of Actuaries, an 
Associate of the Casualty Actuarial Society, an Enrolled 
Actuary for Pensions, a Fellow of the Life Management 
institute, and a Chartered Life Underwriter. 
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by Roy A. Sykes, Jr. 


Reduction 


A master stroke in the design of APL was the generalized 
concept of reduction. Conventional mathematics uses idio- 
syncratic symbols to denote certain common operations 
such as sum (Sigma £ ) and product (pi T). Many pro- 
gramming languages use intrinsic functions like the built- 
in SUM and PROD functions of PL/1. APL, however, 
uses a single keystroke (/) to indicate that a function 
is to be applied between all elements of a vector. The 
operation to be performed is identified by the symbol to 
the left of the slash. 

The advantages of the APL approach are clear. Pri- 
marily, it is conceptually consistent and clear; it unifies 
such divergent semantics as the MAX of FORTRAN (f /), 
the ANY of PL/1 ( v / ), and the sigma of mathematics (+ /). 
By allowing any of APL’s 21 scalar dyadic functions to be 
used as the left argument, useful capabilities not even 
contemplated in other languages are made available in 
APL, such as alternating sum (-/), hyperexponential 
(x /), parity (z/), and last only ( </ ). Reduction forms a 
basis for other powerful operators in APL such as scan 
and inner product. 

Rather than applying uniformly to all elements of an 
array, reduction applies only to a single coordinate, allow- 
ing programmers to structurally discriminate the data 
they want reduced. By default, the coordinate is the last: 


532 Five rows and three columns. 


E/F Sum columns (same as 
729 441 441 441 1089 +/[ppP]P or 
+/[2 JP inorigin 1 ). 


By using the axis operator [K], or # to indicate the 
first coordinate, any other coordinate can be specified: 
EFE 
529-625-4197 Sum rows (Same aS +/[1]P). 
“Reduction” is an apt name for this capability. It con- 
solidates coordinates, decreasing the rank of the argu- 


ment by one (e.g., reducing the matrix P above pro- 
duced a vector). Reduction typically diminishes the num- 
ber of elements (but can you think of a case where it 
increases the number of elements?). And it concentrates 
data much like reducing a sauce concentrates flavor. 

The most common reductions currently used are listed 
below. Those that accept only Boolean arguments re- 
turn 1 fortrue and oO for false; in some cases they can 
be used with other numeric arguments, although with lit- 
tle utility. 


Boolean Numeric 
Arguments Arguments 

+/ How many? Sum 

x/ (see a ) Product 

LZ (see a ) Lowest 

[Z (see v) Highest 

=$ Alternating sum (sum of odd ele- 
ments less sum of even 
elements) 

+/ Alternating product (product of 
odd elements divided by 
product of even elements) 

A/ All? 


v/ Any? (same as ~a/~ ) 

</ Only the last? (same as ~</~) 

z/ Odd parity? (same as ~=/~) 

>/ Odd leading parity? (same as ~>/~ ) 


Nand (x), nor (~), power ( *), logarithm (@), circular 
(©), binomial ( ! ), and residue ( | ) are seldom used in re- 
duction except for trivial cases (for example, 0/1 2 3 
is SIN COS 3 radians). Occasionally one finds unusual 
uses, such as | /¥V , which Larry Breed (IBM) originally 
put forth to determine (in origin 1 ) whether the first ele- 
ment of a numeric vector V is distinctly the lowest ele- 
ment of the vector (A/V[1]<1+V). 

Particularly pleasing are instances where several dif- 
ferent reductions are juxtaposed. For example 

e Do any rows of a character matrix contain only 

stars and blanks? 


ViAJ tT Mes. 


e Calculate the average ratio between the two 
columns of matrix M : 


(+/+/M)+1+0M 


e How many times does the most-frequent vowel 
occur in a text vector? 


| +7 AELOG te SeTEXT 


ə Compute an integer from its prime factoriza- 
tion matrix: 


x/*x/PFM 


e ina matrix of integer keys arranged in ascending 
order, compute the position of a new key: 


14++/</(KEYS-(opKEYS)ONKEY) ,0° 
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Reduction (and its derivative, inner product) serves 
one unique purpose in APL, that of computing identity 
elements. The identity element J for a function F is that 
value which satisfies one of the following equivalences 
for any value A : 


A ++ IEA Left identity 
or 


Ye oe oe ee Right identity 


For example, 1 is the right (but not the left) identity ele- 
ment for division because A+>A+1. Similarly, O is the 
left (but not the right) identity element for modulus be- 
cause A++0|A . Addition (+), or ( v ), and not equal (= ) 
all have O as both their left and right identities; multiplica- 
tion (x), and (^), and equal ( =) all have 1 as both iden- 
tities. Some functions (@,0, x, and » ) have no identity 
elements. 

Reducing an empty vector yields the identity element 
for that function, as does reducing any empty coordi- 
nate: 


“f%9 Right identity only 


+7 taj) 300 
ił 


(These are the cases mentioned above where reduction 
increases the number of elements.) 

Why are identity elements important? Because during 
processing we occasionally encounter empty arrays that 
we wish to reduce and yield consistent answers. For 
example, if ACCTS is a vector of account balances for 
all customers, we wish meaningful answers to questions 
like, “What is the total of all accounts?” 


Right identity only 


+/ACCTS 

0 

“Are any accounts over $1000?” 
V/ACCTS>1000 

0 


even when we've just started the business, “How many 
accounts?”’. 


pACCTS 


To carry the example further, if we consider merging 
two businesses, we should be able to compute pro forma 
total balance by 


(+/ACCTS )++/OTHERACCTS 
1560.49 


Apparently, the other business has $1,580.49 in re- 
ceivables. After the merger, 


ALLACCTS+ACCTS ,OTHERACCTS 
our balance should remain the same 


+/ALLACCTS 
1580.49 


which is possible only if +/10 returns 0. 
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The 21 reductions possible in conventional APL have 
been well explored. STSC’s Nested Arrays System, 
however, allows reductions of not only the 21 scalar 
dyadic functions, but of all primitive dyadic functions. 
Furthermore, the left argument to reduction can also be 
any dyadic derived function (Such as outer product), or 
even a user-defined function with explicit result (such as 
VR+A FOO BV). Needless to say, we now have a lot 
of new capabilities to explore! 

The reason conventional APL does not allow mixed 
functions (Such as + and ; ) to be used in reduction is 
that doing so would violate a fundamental property of 
reduction: that it reduces rank. For example, reducing a 
vector in the Nested Arrays System, 


4/7  § 99 
(ore oo ST ee 


still returns a scalar — this one nested, having seven 
elements. In conventional APL, such nesting is not possi- 
ble; the only plausible result would be a seven-element 
vector, which would be inconsistent with the expected 
scalar result of reducing a vector. The Nested Arrays 
System circumvents this limitation by extending the defi- 
nition of reduction (on vectors) from 


VL] oVL2] eVL3] ©...0eV[pV] 


Length, (-position), value 


@/V <> 
to 
O/V +> ¢(1>V)0(2>V)0(32V)0...0(pV)>2V 


by using pick (>) to select the items rather than sub- 
scription. The result therefore remains the same for 
scalar-function reductions of simple arrays. 

On simple vectors, most mixed primitive function re- 
ductions are uninteresting. For example, € / is precisely 
identical to =/ . The following table gives informal mean- 
ing to reductions of a simple numeric vector of length two 
or more by existing mixed dyadic functions: 


Reduction Equivalence or Error Message 


e/V Ey 
A/V +/V except for 0+0 
T/V | ZV 


Q/V LENGTH ERROR 

\/V LENGTH ERROR (typically) 

1/V RANK ERROR 

?/V RANK ERROR (typically) 

¥/V DOMAIN ERROR (unless of length two) 

o/V > 1+V last element as a scalar (> 1V) 

1/V > 147V last element as a scalar (017) 

Tai cV enclose 

+/V c(-0A.= 1+V)+V last element if all 
others 0 

4/V cV[1]+4 1+V only if all( 14V) are the 
same sign; more complicated otherwise 

o/V cVL1]p(-0A.%# 14V)+V first element 
reshapes the last or fill element 

//V c(x/ 1+V)p 1+V replication reduction 


The final entry is particularly interesting because the 
slash is being used in two senses: as the replication 
function (extended compression) on the left, and as the 
reduction operator on the right. 


EE EST 
ie 3.3 3 °323.-2 2) 

c 2/1/4/3 
LS 23-2 woo oe 


Much more interesting are mixed-function reductions 
of matrices and nested arrays, and reductions using de- 
fined functions. Perhaps the most useful is catenation 
reduction (, /) which discloses and catenates the items 
of each vector along the specified dimension, encloses 
those items, and assembles a properly shaped result. 


SS Meg EF E Reshape each, 
Cy £2 FFT FS Ses (three-item nested vector) 


Ff 2 3 c` £ 7 8 catenate the items together, 
fee EF TRF 8.8. 2) (nested scalar) 
ae ee ee tae € 7 2 and disclose as a simple vector, 


Zz 

7 a 7-35 &-E (nine-element simple vector) 

a a oo a 7 or use inner product, 
oe a a ee EE 

>,/c/ 2% 3 ,11.5] E 7 2 reshape reduction of a two- 
7 7 Te E column matrix, 

2 4 eat or simple replication. 
' 3.3 Q 


The idiom =>,/ is used widely in the Nested Arrays 
System. Its full form is >,{[7]J/CXJA. For first-coordi- 
nate operation we can use + to reduce rows of a matrix, 
and > to catenate as rows. in the next issue we will fur- 
ther explore this idiom, as well as other reductions. In the 
meantime, we'd certainly be interested in any new uses 
for reduction you have found in the Nested Arrays 
System. 
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e The diamond statement separator will be an op- 
tional feature. 

e Conforming implementations must provide 
specification sheets that document system 
limitations (e.g., the maximum rank, the max- 
imum identifier length, and the length of DAT). 

In addition to specific features of the language, the 
standard will address file system design. Consistency 
and compatibility among APL file systems will aid in 
the transportability of large applications that share 
data among users and catalog large amounts of data. 

STSC fully supports the effort to develop an APL 
standard. Two of STSC’s top APL experts are cur- 
rently participating in the work of the X3J10 Commit- 
tee: Clark Wiedmann and Bob Smith, Manager of 
Special Research Projects. 

The public is welcome to participate in the X3J10 
Committee’s activities, though only one person from a 
single organization may vote. For more information 
about the committee’s activities, contact Clark Wied- 
mann, STSC, Inc., 131 King Street, Northampton, 
Massachusetts 01060.m 


ei 
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by Roy A. Sykes, Jr. 


Replication 


The history of APL is filled with instances where a 
rather limited capability was seen as but a special 
case of a more general facility. For example, conven- 
tional dot product, matrix multiplication, was envis- 
aged as just one of a whole family of useful matrix 
transformations. Further, the concept could be ex- 
tended consistently to arrays of rank other than two, 
such as vectors and three-dimensional arrays. Thus 
was born today’s inner product; anyone who knows 
the rules for matrix product (+. x) can easily infer the 
rules for table lookup (^ . =). 

A recent example of this ongoing process of 
generalization is the recent extension to compression 
( / available on STSC’s APL* PLUS VM Service. Until 
several months ago, compression was only a selec- 
tion function; it would either select or ignore portions 
of its right argument based on its Boolean vector left 
argument: 


3 ee oe Bs ee Se ee ee ee ae ee 
o-3=5 
D<+M+ 10 20 30 0.415 
ps ee Se SS Ge 
eS go peers ees ae 
co ex faa a 


O11 0 1 /M Equivalent to 


12 43-435 FERA a Se 
27-23 35 (select columns). 
32-33-35 

O41 -FN Equivalent too 1 1 /[1]M 


EE ot See Sas Ee eS 
dege 3a 34-35 


(select rows). 


The length of the left argument had to match the 
length of the appropriate coordinate of the right argu- 
ment. That is, each subarray along the specified 
dimension had to be explicitly flagged for selection 
(1) or not (0). A scalar or one-element vector was 
also acceptable, and was extended to the proper 
length. 

Several years ago it was recognized that compres- 
sion could be reformulated as an element-by-element 
reshape and catenation. Thus, the first example 
above was viewed as 


(Op t)s(1p2),(ios), (0p4), (ips) 096 
Saas eae 


Why not, it was reasoned, allow any non-negative in- 
teger to be used in the left argument? Why not, in- 
deed; today it is available: 


Opti (462 pS Opu]; 2053, 856 
(Seer Lee ree aoe, Bet tee 
P=} fe > 3 B34 - SB 


2 /M 

Tte d2- be aoe SO Be 
aS Be Ses a eee ae Fe ee 
3232 32 3 ¢ 3 O 


This useful and consistent extension to compression 
is called replication because it can replicate elements 
or subarrays. 

As with any new feature, we are only gradually 
discovering applications of replication. The most com- 
mon usage seems to be in replicating data for scalar 
dyadic functions. For example, SALES is a5-region 
by 20-product by 12-month array. Each product has 
a commission factor (20=pCF). Compute commis- 
sions on sales: 


OLD TECHNIQUE: SALES: 1-3-2 BS 12 20-067 
NEW TECHNIQUE: SALESx5412/ 1 20 1 pCF 


Some common idioms are 


,R(SCAL,pVEC)pVEC  ,&(SCAL,N)piN 
which simplify directly to 


SCAL/VEC SCAL/.N 
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Techniques which subscript an array by the sum scan 
of a computed Boolean vector (ARRAY(+\BV J) also 
often call for replication. Thus 


VY R«REP REPLICATE DATA 
Lt R«+REP+#0 x DATA<R/DATA x REP<R/REP 
[2] R+(+/REP)+OlIO » RCOLO++\ 1+REPI]<1 
ze R+DATAL+\F2] 

V 


simplifies directly to R«REP/DATA_ with concomi- 
tant savings in workspace storage and computer 
resources. 

The following function illustrates another use of 
replication, consecutive underscoring. 


TRUR As 38 
C1] a UNDERLINE NON-BLANK CHARS IN A VECTOR. 
[23 B+QTCBS ,OTCLF ,UTCWL ,OTCNUL ,OTCDEL ,UTCBEL 
[3] B+1,AeB,' _ABCDEFGHIJKLMNOPQRSTUVWXYZA' 


C4] C+B<R+1O0B ® R+pB+B>R 
CST Cesc y(c7 ik )- Bs eh 
[6] Petts E i tyes he) p72 ee cee 

V 
Line [3] flags all characters not to be underscored, 
which includes control characters, blanks, and 
characters already underscored. Line [4] marks the 
beginnings ( B ) and ends ( C , offset by one) of all 
passages to be underscored. Line [5 ] computes 
the length of each passage, and positions that value 
at the end of the passage. Finally, line [6 ] com- 
putes a replication vector which selects all elements 
of A , and intersperses the proper number of con- 
tiguous backspaces and underscores. 


UL: ' WHO? TS -ONE FLRS? 
WHO'S ON FIRST? 


——_—— a =< — => 


(Note: The capabilities of replication are also available 
on STSC’s MVT-based APL * PLUS Service as the 
function REPL in workspace 6 UTIL.) 


((WESTING) (NOTE(S))) 


The Nested Arrays System has extended replication 
even further to allow negative elements in the left 
argument. Such elements insert the indicated number 
of fill elements into the result: 


Gk 4-8-9. 6 r a 28-3. A 3-8 
> os 9-5 8 §-6-0-90 

t 41-41 6 6:2 7° *ASTRIVE" 
A TREE 


If the left argument is composed with the replica- 
tion function, an ambivalent function is created. 


1-4 t -#26 0 - Sef 4ASTRI DE 
A TREE 


Used monadically, that function applies a fixed 
replication to whatever right argument is presented. 
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(o> Sha: = 6 Os 2a 8 SS? 
i -0-3 & 74 


But when used dyadically, 


eS oe oe eee eS ee eee ee ce ee ee, 
I Sa Se “ey 


the function is called mask, with negative values selec- 
ting from the left argument (rather than the fill element) 
and positive values selecting from the right argument. 


PRIESTESS? <4 40" 2-4. 41-18 75> CAs reser" 


PRIVATEER 


The previous ‘“‘Whizbang” discussed reduction and 
its generalization in the Nested Arrays System. 
Specifically, disclose of the catenation reduction 
(>, /) was introduced. Let’s explore this idiom further 
by looking at its effect on a matrix of matrices: 


MM+('FIRST')('SECOND')('THIRD')('FOURTH' )('FIFTH')('SIXTH' ) 
MM<«+O+ 2 3 p 1 7°9 74 MM 
(FIRST Y} {SECOND J). (THIRD ) 


MM is a 2 by 3 matrix, 
each item of which is 
a 1 by 7 matrix. 


(FOURTH J- (FIFTH ) (SIXTH -) 


Because MM is a matrix, we can catenate-reduce it 
either across the last dimension (columns) by using 
,/MM as shorthand for ,/L2J]MM , or down the first 
dimension (rows) by using , 74M as shorthand for 
»>/L1IMM . 

Reduction across the columns produces a two-item 
vector of 1 by 21 matrices: 


» / MM 
(FIRST SECOND THIRD ) (FOURTH FIFTH SIXTH ) 


Reduction down the rows produces a three-item vec- 
tor of 1 by 14 matrices: 


CPIRST: FOURTH )} (SECOND: PIFTH.—) ATHIRD SIXTH ==) 


In both examples above we performed column 
catenation (placing the matrices side-by-side) by us- 
ing , as shorthand for , L2 ]. We can also perform 
row catenation by using ; as shorthand for ,[ 1]. 


Row catenation across the columns produces a 
two-item vector of 3 by 7 matrices: 


> /MM 
(FIRST (FOURTH 
SECOND FIFTH 
THIRD ) SiATH ) 


Row catenation down the rows produces a three-item 
vector of 2 by 7 matrices: 


> *MM 
(FIRST (SECOND (THIRD 
FOURTH -) FIFTH ) GLASK ) 


Notice that only , /MM and 5 #MM preserve the row-major 
ordering of the original matrix. 


Subsequent reductions on any of the nested vec- 


>5/37MM A >5/,8MM 


tors shown above will produce a nested scalar. If we FIRST 
disclose that scalar, we reveal its item as a simple FOURTH 
matrix. The six interesting combinations are shown SECOND A 6 by 7 matrix. 
below: FIFTH 
THe 
>,/,/MM a >,/,MM SIATH 
FIRST SECOND THIRD FOURTH FIFTH SIXTH A 1 by 42 : 
matrix >3/,/MM A 2,/37MM i 
i FIRST = EECCA? THERD A 2 by 21 matrix. 


FOURTH REETH SlALH 
a TIMM A »>5s/,7MM A >>/,/28MM 


>,/ fhe. a drk SMM 
FIRST FOURTH SECOND FIFTH THIRD SIXTH ÅA by 42 


55/5/MM a >5/,MM matrix. FIRST FOURTH 
FIRST SECOND FIFTH A 3 by 14 matrix. 
SECOND A 6 by 7 matrix (the last THIRD SIXTH 
THIRD column is all blanks). . 
FOURTH Since the right-most reduction returns a vector, it 
FIFTH is immaterial whether the left-most reduction is along 
SIXTH the first coordinate (,7) or the last (,/).™ 
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by Roy A. Sykes, Jr. 


No Sort Grade 


In several previous Whizbangs (April, 1978 and 
January, March, and September, 1979) we dis- 
cussed the APL grade primitive functions ( 4 and ¥ ), 
mostly regarding their use to sort data—no doubt 
their commonest application. However, recent 
enhancements to STSC’s APL* PLUS VM Service 
allow grade to apply directly not only to numeric vec- 
tors, but also to numeric and character arrays of 
higher rank. Sorting a character matrix CM according 
to a collating sequence CS is now trivially simple: 
CMCLCSACM;]. So this month we'll present several 
other uses of grade, but no sorts. 

The result of grade up applied to a numeric vector 
is not obvious to most people on their first encounter. 
The numbers represent a permutation of the indices 
in the argument which would reorder (sort) the argu- 
ment in ascending order: 


A 6.4 ZTF S282 


j= 
on 
(o>) 
lf 


ae ie Sica oo apie of the 7 
(highest number) 
indices of the 6’s 
(third lowest number) 
index of the 3 (next 


lowest number) 

indices of the 2’s 

(lowest number) 
Notice that the indices for each set of distinct 
numbers are strictly increasing; that is, grade up 
preserves the order of identical values in its argu- 


ment. This important property, called stability, also ap- 


plies to grade down. 


ae of the 2’s 
(lowest number) 
index of the 3 (third 


highest number) 

indices of the 6’s 

(next highest 

number) 

index of the 7 
(highest number) 

We can use this property in several ways. For ex- 

ample, YA returns a permutation that would 

reorder A in ascending order, but with the indices of 

equal values in strictly decreasing order. 


Jw 
JNO 
H 
gE 


| L— index of the 7 
indices of the 6’s 
index of the 3 


indices of the 2’s 
Given a vector V partitioned into logical segments, 
and a Boolean vector P flagging the start of each 
partition, we can easily reverse the values within each 
partition. (We’ve underlined values associated with 
each partition for clarity.) 


Pe 4-0 yey Too 

ts aoe h 

+\P Assign equal values to each 
1—1 2 3.3.3 4 partition in ascending order. 

Y+\P Compute the descending in- 
£ 3.8.8°R Joe dices of each partition in 

descending order. 

oY+\P Compute the ascending in- 

sah S-bs3 4-7 dices of each partition in 


ascending order. 


VL OY+\P J 
5 4S e932 


Rearrange the vector accord- 
ingly. 


This is the algorithm used in PREVERSE in 
workspace 6 PARTFNS onthe APL*PLUS 
System. 

Ranking values is another useful application of 
grade. The usual ranks AAA (ascending rank) and 
AYA (descending rank) indicate where correspon- 
ding elements of A rank relative to one another and 
to their position in A . Incidentally, this is the result 
that most beginners predict from grade alone. 
ae 
+ 3° 4 
The disadvantage of this type of ranking is that equal 
values (for instance the three 6’s) are not assigned 
equal ranks (4 5 6). A simple remedy is to sort the 
vector, preserving the grade permutation, 


O+S+ACO+P+4A J 
ie oe ae Be ee 
2°38 ES 8.2 


mark the beginning of each partition, 


O+B< 141,8215 
ict 4S Gees 


assign rank values to each partition. 


(]+R<+\B 
4 | 2 4 Bes. 


and finally assign those dense ranks to the positions 
P from which they originated. 
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V R«DRANK A;P 

4] a DENSE RANK OF NUMERIC VECTOR. 

]  A+*ALR+P+hA] 
a3 RCPJ+\ 14+1,A4416A 

V 

Dense ranking (as performed by NDRANK in 
workspace 6 SORT ) is useful because it computes a 
small and unique scalar key for each different value in 
the original array. Furthermore, that key perfectly ex- 
presses the relative ordering (although not position) of 
the original values. These keys can be used conven- 
iently in subsequent computations. 

In the next “Whizbang” we'll discuss two more 
nonsorting applications of grade. The first is finding 
unique rows in a matrix, as well as finding all 
anagrams in a list of words. The second was originally 
presented in Bob Smith’s “Problem Section” in the 
June 1972 issue of APL Quote-Quad (Vol. 3, No. 5). 
We'll restate the problem so you can work on it in the 
meantime. 

Progressive Dyadic lota Problem (contributed by 
Jeff Lewis, The Rouse Co.): Find a line of code 
that, given a vector (say L ) and either a scalar 
or vector (say R ) of the same type as JZ, will 
find the occurrences of the elements of R in 

L (just like LıR ) except that duplicate 
elements in R are handled differently. 


For each set of identical elements in #, the first 
one is assigned the subscript of its first occur- 
rence in Z , the second one is assigned the 
subscript of its second occurrence in LZ, etc. 
Also, like dyadic iota, if an element in Z does 
not occur in R (or does not occur enough 
times in Z ) then it is assigned the value of one 
plus the highest allowable subscript for L. 


An example (in origin 1): 
L*"ABACDBC" 
R+'DDBACBAXB'! 


resik =i 8 2:1 &U- 63828 


(Twenty-seven characters, in either origin, using 
no named temporary storage.) 


((NESTING)(NOTE(S))) 


Strand notation is one of the most revolutionary 
features of STSC’s Nested Arrays System. It is also 
somewhat controversial and misunderstood. The July 
1981 ‘‘Whizbang” introduced strand notation; let’s 
explore it in more depth. 

One annoyance of APL is the paucity of arguments 
available to a function: one or two. We do know that 
some functions require more than two arguments. 
Many primitive functions have three arguments 
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(Ad[K1]B), and some have four, counting implicit 
(global) arguments such as OCT, DIO, and URL 

( A1B and A?B ). The anomalous syntax of 
subscription and OFMT apparently reflects an ir- 
resistible early need for polyadic functions. With tradi- 
tional APL, we’ve adopted a number of tricks to cir- 
cumvent this problem, among them global variables, 
data encoding and packing, and executable character 
strings. Furthermore, functions often compute more 
than one useful result, yet we only have a single ex- 
plicit result at our disposal. 

Strand notation allows us to easily join and 
separate multiple values. We are no longer limited to 
infix syntax, but can choose prefix syntax if we wish. 
For example, 


V R«{FS} AFMT VALUES 
C1] +(ONC 'FS')+DEFAULT 
C3) R+FS OFMT VALUES © FS¢'LF',(¥3+p¥VALUES+2>pR),'.0' 
[3] R+(FS OFMTi\VALUES),R © +0 
[4] DEFAULT:R«¥ VALUES 
y 


can be called in any of the following ways: 


AFMT A 

SINT A BL 

A AFMT B 

A AFMT B C D 


or even 
ABC AFMT CDE 


if OFMT were extended to accept a nested left 
argument. 

Beginning APL’ers often stumble when they try to 
generalize the natural concept of a constant numeric 
vector: 


Re OQ Geko Ver Rae 
2%. pV 
a oo 
a. &. 8.8 
oo py 
SYNTAX ERROR ? 


(2UCi oy 
SYNTAX ERROR ie 
(R-C pY 
A 


Without catenation, they’re lost. With strand notation, 
the concept extends gracefully: 


hf oF 
Ss 
tie fee 


Figure 1 on page 7 shows some examples that il- 
lustrate the progress we’ve made. 


APL objects are 
dynamically bound; that is, 
their definition (variable or 
function, monadic or 
dyadic, etc.) is interpreted 
only at execution time. 
One consequence of this 
dynamic binding is that a 
statement may have 
several interpretations 
when viewed in isolation. 
For example, in 


a F-G 


the only things we know for sure (assuming all ob- 
jects execute without error) is that # is a variable or 
niladic function, and that £ cannot be dyadic. Try to plicit results. a 
anticipate the next issue by listing all possible inter- 


Old Arcane Notation 


C+5+B<+3+A+1 A+1 x B<u x C9 
R<«T,0pL+R,0pT*+L xx T<+L x L+R x R<T 
R+1+t+(pMAT),C+1+pMAT R+1t+oMAT x C+1+pMAT 


+(2 FITT AFGI fes] >+(Z6,28,L3)[2+xS] 


** Works for vectors and zero-row matrices only. 


Figure 1 
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Modern APL Notation 


A S G¥1 4 3 


NARS Strand Notation 


>L6 L8 L3C2+xS] 


pretations of the above statement. You can assume 
that all four objects are variables or functions with ex- 
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